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Abstract. It is a theorem of Wyner & Ziv and Ornstein & Weiss that if one observes the initial k 
symbols Xo, . . . ,Xk-i of a typical realization of a finite valued ergodic process with entropy h, the 
waiting time until this sequence appears again in the same realization grows asymptotically like 2 hk 
|7I I12|. A similar result for random fields was obtained in in this case one observes cubes in Z 
instead of initial segments. 

In the present paper we describe generalizations of this. We examine what happens when the set 
of possible return times is restricted: Fix an increasing sequence of sets of possible times {VKn} and 
define Rk to be the first n such that Xo, . . . ,Xk-i recurs at some time in W n - It turns out that 
IWijJ cannot drop below 2 hk asymptotically. We obtain conditions on the sequence {Wn.} which 
ensure that | Wjj fc | is asymptotically equal to 2 hk . 

We consider also recurrence densities of initial blocks and derive a uniform Shannon-McMillan- 
Breiman theorem: Informally, if Uk.n is the density of recurrences of the block Xo,...,Xk-i in 
X- n , . . . ,X„ then Uk,n grows at a rate of 2 hk , uniformly in n. We examine the conditions under 
which this is true when the recurrence times are again restricted to some sequence of sets {W n } 

The above questions are examined in the general context of finite- valued processes parameterized 
by discrete amenable groups. We show that many classes of groups have time-sequences {Wn} along 
which return times and recurrence densities behave as expected. An interesting feature here is that 
this can happen also when the time sequence lies in a small subgroup of the parameter group. 



1. Introduction 

1.1. Background. Let {X n } n ^z be an ergodic process with values in some finite set S, defined on 
a probability space (fl, T , P). Let h denote the entropy of the process. For a realization x = (x n )nez 
of the process and for each k consider the finite word xqX\ . . . Xk-i obtained by viewing x through 
the "window" consisting of coordinates 0,1, ... ,k. The first backward recurrence time of this word 
is defined as 

R7{x) = min{n : n > and x„ n+ j = Xi for all < i < k} 
Set Rfr(x) = oo if the sequence xo^i . . . Xk-\ does not appear again in the past portion of x. 

In ^5]> A. Wyner and J. Ziv proved that ^ log R^ — > h in probability and limsup fc ^fe — ^ 

almost surely. This result was strengthened by D. Ornstein and B. Weiss, who proved in [7j that in 
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fact 

(1.1) lim hogRr(u) = h 

k^oo K 

almost surely. 

Wyner and Ziv's definition of R7 uses the past and future in an asymmetric fashion, but in 
[7] Ornstein and Weiss also define a more symmetric quantity, and prove a related limit result, 
which they later generalized in [S] to the case of Z d processes, or so-called random fields. These 
are families of random variables {X u } u€ %d which are stationary with respect to "time shifts" in Z rf . 
In this setting, the patterns we are trying to match, and the sets in which we look for recurrences 
of patterns, are based on the cubes F n = [— n; n] d (here and throughout, we denote the integer 
segment {i,i + 1 . . . ,j} for i < j by For a realization x = [x u ) u&Ij d of the process, we obtain 

a "pattern" on the cube F^, which we denote x(i^), by coloring each v £ with the color x v . The 
first recurrence time of the -F^-pattern x{Fk) is defined to be the first index n for which we observe 
the pattern x{F^) centered at some point in F n other than the origin. To be precise, 

for some v ^ in [— n; n] d I 
x v+u = x u for every u G F k J 

or Rk{w) = oo if the pattern does not recur. Then according to jHj, with probability 1 

(1-2) hm ^ W ^Rk^)=h 

(in [S] there is also a proof for patterns based on the cubes [0; n] ). 

Our goal in this paper is to describe several generalizations of these phenomena for processes 
indexed by other amenable groups, though many of the results are new even for Z-processes 



i?fc(w) = min < n 



1.2. Notation and preliminaries. Recall that a discrete countable group G is amenable if there 
exists a sequence of finite subsets {F n } of G which are asymptotically invariant, ie for all g £ G, 

lim ' F ", A f»' = 0. 

For example, the sequence {[— n; n] d } c ^'_ 1 in Z rf has this property. Such sequences are called F0lner 
sequences. Asymptotic invariance is equivalent to the following statement: for every finite K C G, 
it holds that 

lim -L|{/ G F n : KfCF n }\ = l. 
n— »oo \r n \ 

There are other equivalent definitions of amenability, some of which have a more analytic flavor, 
but these will not interest us here; see eg 0. 
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For G a discrete amenable group we consider an ergodic process {X g } g£ G taking values in a finite 
set S, defined on a standard probability space (fi, J 7 , P). We will always assume that the process 
arises from an ergodic measure preserving free left action of G on Q, denoted (g,u>) i— > gu>, and from 
a measurable function X : VL — > S such that X 9 {uj) = X(gLj). We denote by /i the entropy of the 
process. 

It is convenient to associate the elements to G f2 with the S-colorings of G they induce: g *— ► 
We will often denote elements of 17 by the letter x to emphasize this point of view, and write x(g) 
for X g (x). For x G f2 and E C G we denote by x(E) the coloring of E by colors from S given by 
<7 1-> x(g), and let [x(.E')] G .T 7 be the atom 

[x(£)] ={x'G!]:V/GB = x(f)}. 

More generally, if cp € S' 5 ' is a coloring of i? by S, we write [<p] for the atom defined by if 

[<p] = {x e Q : Vf e E x(f) = <p(f)} 

and for $ C E f » we set 

[*] = U M- 

It is known that the ergodic and entropy theorems hold along certain F0lner sequences. The 
weakest condition known to ensure this is 

Definition 1.1. A F0lner sequence {F n } in G is tempered if for some constant C independent of 
n, l(Ufc<n F fc _1 ) F nl ^ C \ F n\ for every n. 

(We extend the group operations to sets in the usual manner, so AB = {ab : a G A, b G B}, 
etc). 

Throughout this paper, {F n } will denote a tempered F0mer sequence which also satisfies \F n \ > n. 
It is not hard to see that if G is an infinite group any F0lner sequence in G has a tempered 
subsequence satisfying this. 

For a tempered F0lner sequence, the ergodic theorem states that for every / G L 1 (f2, J 7 , P), 

lim TFT E^)= //dP 

almost everywhere and in L 1 . 

The entropy of a process {Xh} gG G is defined as in the case of Z-processes by 
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where H(-) is the usual entropy function defined for any finite-valued random variable. For a 
discussion of entropy theory of amenable processes see . 

For such processes, the Shannon-McMillan-Breiman (SMB) theorem states that for {F n } a tem- 
pered sequence, 

-LlogP([x(F n )])^h 

for almost every x 6 fL In this generality, these theorems are due to E. Lindenstrauss see also 
PU and |5| . 

1.3. Return times. In the return-time theorems stated above for Z and Z rf processes a single 
F0lner sequence {F n } (segments or cubes) was used both to define the "shape" of the patterns whose 
repetitions we are looking for and as the shape of the domain in which we seek these repetitions. 
There is no reason for the same sequence to play both these roles, and it will be useful to separate 
them. In our formulation of the problem we will consider two sequences of finite subsets of G: {F n } 
(F for F0mer) which will serve as the shape of the patterns, and {W n } (W for window) which will 
serve as the set within which we search for repetitions. Throughout, {F n } will be a tempered F0lner 
sequence satisfying \F n \ > n. The properties we demand of will vary. 

We will be interested in the recurrence of i^-patterns in the windows W n . We say that x{Fk) 
repeats at a point / € G if fx G [^(i^)], ie if fx(g) = x(g) for every g S Fk- If x{F^) repeats at /, 
we say that the repetition is disjoint from Fk (or from x{Fk)) if Fk fl F^f = 0. 

Define 



Rk{x) = R k F,W \x) = min <^n 
or Rk{x) = oo if the set on the right in empty. 



there exists a disjoint 
repetition of x(Fk) in W n 



Remarks. 

(1) Rk(x) is the first "return index" of x to excluding recurrences which intersect the 
original pattern x(Fk). In this respect our definition differs from the definition for the case 
G = TL d from [S] described in the previous section, where it is not required that the repetition 
be disjoint from the original occurrence of the pattern. 

(2) We require only that the "center" of the repetition / to be in W n . We do not require the 
whole of the repetition to be in W n , ie we do not require Fkf Q W n . Note that in the 
known cases of cubes, F n ,W n = [—n;n] d C Z d , requiring that the repetition of x(Fk) be 
completely contained in Wr, does not change the asymptotic behavior of Rk- In general, 
however, the definitions are not equivalent. Indeed it is not necessary that any translate of 
Fk be contained in any W n . 
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(3) In what follows we will be working with several different sequences and then the notation 
R^ will be used to emphasize the dependence on {F n } and {W n }. 

(4) If {F n } is any sequence (not necessarily F0lner) and {W n } is a tempered F0lner sequence 
then the ergodic theorem implies that R k F ' W \x) is finite almost surely. In other cases it is 
not clear that Rk < oo, and for general window sequence this may not be the case. 

Rk is not a very "stable" quantity, and even when finite we cannot hope for its growth rate to 
provide any significant information about the process, since it strongly reflects the indexing of the 
sequence {W n }. For any G-process and choice of {F n } and {Wn}, the value of Rk may be arbitrarily 
increased simply by allowing repetitions in the W n 's, or decreased by thinning the window sequence 
out. A better measure of return time is the volume (=size) of the window set in which the repetition 
is observed. We therefore define 



rp _ T (F,W) _ m | 
ifc — i fc — \ W R (F,W) | 

h 



or Tk = oo if Rk = oo, and set 



T * = T (F,W) = iimsup jrrr lo g T !f' W) 



T* = T^ F ' W ^ = liminf — — logTjf' W \ 

fc^oo 

The aforementioned result for Z rf can be given in terms of since for W n = [— n; n] d , the size of 
Wr u is (2Rk + l) d ; the result then is equivalent to T* = T* = h. 

One half of the Z d return-time theorem is true in the following very general form, which provides 
new information even about Z processes: 

Theorem 1.2. Let G be a countable amenable group and {X g } g£G a finite-valued ergodic G-process. 
If {Fn} is a tempered F0lner sequence in G and {W n } an increasing sequence of finite subsets of G, 
then for almost every x, 

TF' W \x) = liminf j^rlogrf • W \x) > h. 

One might hope that the bound T* < h is also true; if it were, we would have lim Tjr-j log Tf, = h. 
This need not be true for general window sequences. The asymmetry between T* and T*, which 
does not appear in the special cases examined in [S], is not due to some shortcoming of our methods, 
but is inherent in the problem. To begin with, in order for the upper bound to hold, at the very least 
some growth condition must be imposed on {W n }. For example if we set F n = W n = [0; 2l i?n ~ 1 l] C Z 
then |-F n +i| > 2' Fn ' so t^-t logT^ > 1 is always true, even if h < 1. We therefore cannot hope for a 
good upper bound on the growth rate of if we do not restrict the growth of the window sequence 

{W n }. 
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Even if the sequence {Wn} grows slowly, the inequality T* < h need not hold. For a simple 
example, consider the process of alternating sequences of Os and Is with equal probabilities for 
observing or 1 at any given place, so the possible realizations are . . . 0101010 . . . and . . . 1010101 .... 
This process is clearly ergodic and has entropy 0. If we take F n = [—n; n] and W n to be sets 
containing only odd integers, eg W n = [— n, n] n (2Z + 1), then = oo for every realization. In 
this example {W n } is of course not a Folner sequence; in section 12.71 we show that the upper bound 
may fail even when {W n } are "almost" segments and possess extremely good properties which are 
known to be sufficient for the ergodic and SMB theorems to hold. 

In order to state our results about T* we will require to have some additional combinatorial 

properties, which will play a role, in various forms, also in the density theorems stated below. 

Definition 1.3. Let {W n } be some sequence of finite subsets of G. A sequence of right-translates 
{Wn{i)fi}l=i (fi ^ G) is called incremental if 

(1) n(l) > n(2) > ... > n(I). 

(2) ft i Uj^W^fj for 1 < i < I. 

Definition 1.4. Let {W 7 ^} be a sequence of finite subsets of G with Iq € W n . {W n } is filling with 
constant C (or C-filling) if for every incremental sequence {W n u\fi} it holds that | U W n ^fi\ > 
\ W n r{\fi\. A set W is individually C-filling if for every incremental sequence {Wfi} it holds 
that \ UWfi\>CJ2\Wfi\. 

The statement that {W n } is filling means, informally, this: suppose one tries to cover G with 
translates of the W n , using a "greedy" algorithm, that is, putting down some translate of a W n , then 
another (possibly smaller) over a point not yet covered, etc. Note that we allow the different 
translates to intersect. Then if you stop after finitely many steps, the size of what has in fact been 
covered will be proportional to the total of the sizes of the W n 's used up until that stage. 

A fixed set W is filling with constant because in an incremental sequence each point is covered 
at most \W\ times, but generally sequences {W n } containing infinitely many different sets will not 
be filling, as demonstrated by the sequence {[Osn]},^^ in Z (though not filling, this is a tempered 
F0lner sequence). On the other hand one can verify that {[— n; n]}^ ! L 1 is filling in Z. Still another 
example is the sequence {[— n;n] n 2Z} in Z which is filling but not a F0lner sequence. 

From these examples it should be clear that the notion of being filling is rather delicate, and not 
related directly to amenability. In particular, note that while a Folner sequence is in some sense a 
large subset of the group, a filling sequence can be small, eg contained in a proper subgroup. 

Definition 1.5. An increasing sequence {VFn} of finite subsets of G with 1q € W n is quasi-filling 
if each W n is C n -filling and ^ ^p) — < oo for every a > 0. 
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Note that in general filling sequences need not be quasi filling. 

Theorem 1.6. Let G be a countable amenable group and {Xg} g&G a finite-valued ergodic G-process. 
If {F n } is a tempered F0lner sequence in G and {W n } either a filling or a quasi-filling sequence 
such that lim n = 1> then 



1 



T* = lim sup log Tf' W) < h 

k— >oo \-^k\ 

almost surely. 

Corollary 1.7. If {F n },{W n } satisfy the conditions of theorem \l.b\ and \4.Q then 



1 



lim - — -Ios,t!: F ' W ^ = h 



fc^oo \Fk\ 



almost surely. 



It is interesting to note that in [S], the return times theorem was proved also for the case F n = 
W n = [0;n] d C which is not covered by our results, because {[0;ra] d } is not (quasi) filling. We 
will not discuss this here. 

1.4. Recurrence densities. As we have seen, if {W n } grows too swiftly it may not be possible 
for T* < h to hold, because the first W n within which we observe x(Fk) may be too large compared 
to Ff~. To compensate for this, one expects that the number of times the pattern x(Fk) appears in 
W n will grow along with \W n \. 

Another way of looking at it is this: Suppose {F n } , { W n } are sequences such that the return times 
theorem of the last section holds: r^rr \ogT^ F,W ^ — > h. If {Wn} grows slowly enough, then one sees 
from the return time theorem that the density of recurrences of x(F^) in Wr. m—x is approximately 
2~ h \ Fk \ since only the central copy of the pattern exists. Now suppose in addition that {W n } is 
itself a tempered F0lner sequence; since by the SMB theorem the probability of the atom [^(i 7 ^)] 
is approximately 2~ h \ Fk \ the ergodic theorem tells us that for large enough n (depending on k 
and x) the frequency with which x(Fk) repeats in x(W n ) is approximately equal to the probability 
of the atom [x(-Ffe)]. So we see that for large k, the density of recurrences of x(Ff c ) in x(W n ) is 
close to 2~ h \ Fk \ both when n is small (but large enough for the question to be meaningful) and for 
large enough n. However, neither the return-time theorem nor the ergodic theorem give us any 
information about what happens for intermediate values of n. 

We define the recurrence frequency of x(Fk) in x(F n ) to be 



Uk,n{x) = U^^\x) = —— max < 

W n 



\E\ 



1g £ E C W n , the collection 
{Fkf}f£E is pairwise disjoint, 
and x(.Ffc) repeats at / 
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and 

U* = U^ w > = limsup sup - f-L log 

k^oo n \^k\ 



£/, = ur w > = liminf inf-r^-r log(^ n 



fe^oo n \F k \ ^ k ' n \W n \' 

Remarks: 

(1) The quantity U\~ n was denned as the (maximal) density of disjoint repetitions of x(Ff.) in 
W n , one of which is the original pattern. One may attempt to define Uk, n differently, eg 
by counting every repetition or counting every repetition disjoint from the original pattern 
(but not mutually disjoint). These variations are discussed in more detail in sections 13.11 
and PI 

(2) The reason for the correction factor — jpjry m the definition of U* is that our goal is to prove 
U* > h, but in our definition of Iff- n we allowed the "central" copy of x{Fj t ) to be counted. 
This being the case, we cannot expect the observed frequency Uk, n to drop below 2~ h \ Fk \ if 
the size \W n \ is not on the order of 2 h \ Fk]{ . 

As in the case of T* , in order to prove U* = U* = h we need some assumptions about the combi- 
natorial properties of the window sequence {Wn}. For the bound U* < h, it is enough that {W n } 
be filling ( definition II .41 above) : 

Theorem 1.8. Let G be a countable amenable group and {X g } &G a finite-valued ergodic G-process. 
Let {F n } be a tempered F0lner sequence in G. If {W n } is a filling sequence, then 

U* = limsup sup — — — log ui F ' W ^ < h 

fe^oo n | J- fc| 

almost surely. 

See also theorem 13.31 below for a slightly weaker version which applies to quasi-filling sequences. 
For the lower bound £/* > h we need an even stronger notion of "filling up space" : 

Definition 1.9. An increasing sequence {M^} of finite subsets of G with 1q £ W n is said to be 
incompressible with constant C (or C-incompressible) if for any incremental sequence {W n (i)fi} ■> 
the number of the sets W n ^fi containing 1q is at most C. A finite set W C G containing Iq is 
individually C-incompressible if for any incremental sequence {Wfi}, the number of the sets Wfi 
containing \q is at most C. 

Clearly the condition that 1q be in at most C sets is equivalent to requiring that every g £ G is 
contained in at most C sets. It follows that a C-incompressible sequence is ^-filling. 

Any finite set W containing 1^ is |W|-incompressible. Ascending sequences of subgroups, and 
ascending centered cubes in are examples of incompressible sequences (see also section 0). 
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We also need the following: 

Definition 1.10. Let {-Fn}) {W^n} be sequences of finite subsets of G. A sequence {Y n } of finite 
subsets of G is called an interpolation sequence for {F n }, {W n } if 

(1) {Y n } is filling. 

(2) There exists a constant C such that if \W n \ > \Y^\ then lY^^nl < C|Wn|- 

(3) For every pair of real numbers < a < f3, for every large enough n there is an index k such 
that 2 Q I F "I < \Y k \ < 2^I F "I. 

In the case where F n = W n , we say simply that {Yfc} is an interpolation sequence for {F n }. 

Note that {F n } appears only in condition (3) of the definition, and will be satisfied automatically 
(for any sequence {F n }) if lim n ^"^"^ = 1. 

As an example, one may consider the sequence {[— n;n]} of intervals in Z, which is its own 
interpolation sequence. In fact if G contains an element of infinite order, and we identify the 
infinite-cyclic group it generates with Z, then segments in this subgroup can often serve as an 
interpolation sequence for a suitably chosen window sequence in G. We come back to this in section 

run 

Theorem 1.11. Let G be a countable amenable group and {X g } geG a finite-valued ergodic G- 
process. Let {F n } be a tempered F0lner sequence in G. Let {Wn} be an incompressible sequence. If 
there exists an interpolation sequence for {F n }, {W n }, then 



almost surely. 



Since T* > h almost surely, we have U* = liminffc inf n — ■m- log Uk,R k (ie the correction term may 
be dropped), so 



Corollary 1.12. If {F n },{W n } are as in theorem \l.ll\ then 

lim — — !— log U^^^ = h 
k \Fu\ B k ' Rk 



almost surely. 

A slightly weaker condition than incompressibility is the following: 

Definition 1.13. Let {I^n} be an increasing sequence of finite subsets of G containing 1^, and sup- 
pose that each W n is individually C-incompressible (for the same C). {Wn} is quasi-incompressible 
if for every A > and large enough k, for every n > k the Wk boundary of W n is at most a iWfcl - ^ 
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fraction of W n , ie 

-^-{f G W n : W k f % W n } < \W k \-\ 

Note that with this definition, an incompressible sequence may not be quasi-incompressible. 

Theorem 1.14. Let G be a countable amenable group and {X g } a£G a finite-valued ergodic G- 
process. Let {F n } be a tempered F0lner sequence and {W n } a quasi-incompressible sequence in G. 
If there exists an interpolation sequence for {F n }, {W n }, then for almost every x, 

U^ F \x) = liminf inf-^log(l^/) (x ) - JL) > h . 

fc-^oo n \t k \ ' \±< n \ 

Remark. We have not required in any of the theorems that {W n } be a F0lner sequence. In fact one 
may verify that if H < G and { W n } is a sequence of subsets of H satisfying one of the various filling 
properties described above (except for the property of the existence of an interpolation sequence), 
then the property continues to hold when {W n } is viewed as a sequence of subsets of G. This 
means that the recurrence phenomena we have described can often be observed when one counts 
repetitions in a subgroup of G. We will come back to this in more detail in section ITol below. 

The rest of this paper is organized as follows. In section |2] we discuss various ways to construct 
good window sequences and classes of groups in which there exist good window sequences, with 
respect to which the recurrence phenomena hold. We also give an example showing that other 
plausible combinatorial properties of {VK n } do not ensure that the upper bounds for T* and U* 
hold. In section |3] we prove the upper bounds T*,U* < h. In section 21 we prove the lower bound 
T* > h, whose proof has a different flavor than the others, and J7* > h, which depends strongly on 
the lower bound T* > h. Finally in section |S] we state some open questions. 

Acknowledgement. The results in this paper are part of the author's MA thesis, conducted under 
the guidance of Prof. B. Weiss, whom I would like to thank for all his patience and good advice. 

2. Constructions, examples and counterexamples 

In this section we will examine the problem of the existence of good window sets. We consider 
two questions: When do good window sets exist at all, and when do "large" window sets exist. By 
large we mean window sequences which are themselves F0lner sequences. We also give an example 
which shows that, while the filling properties may not be strictly necessary for our theorems to 
hold, other combinatorial conditions on the window sequence, which are known to be sufficient for 
other pointwise theorems, are not sufficient for upper bounds on return times and densities. 
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2.1. General constructions. Before going into concrete examples we examine how good window 
and interpolation sequences can be constructed from other sequences. 

Lemma 2.1. Suppose {W^} is C -incompressible in G' and {W"} is C" -incompressible in G" . Set 
W n = W^x W£. The {W n } is incompressible in G = G' x G" . 

Proof. Clearly {W n } is increasing and 1q € W n for all n. Suppose {W n ^gi\ =1 } is incremental 
and If? G ^Wn{i)9i'i we m ust show that I is bounded. Write gi = (g'^g"). Since 1q> G ^i^n(i)9i 
and \qh £ ^iW"^g" , it suffices to show that there is a C such that if / > C then either there is 
a subsequence ■ ■ ■ ,i(C' + 1)} C {1, ... ,1} with {W^^j^g'^^f^ 1 incremental or there is a 

subsequence {i(l), . . . , i(C" + 1)} C {1, . . . , 1} with incremental. 

We prove this claim by induction on C' + C" . To be precise, we claim that for each pair C', C" 
there is a number a(C',C") such that if {W n ^gi}\ =l is an incremental sequence in G of length 
/ > a(C, C") then either its projection on the first coordinate has an incremental subsequence of 
length greater than C 1 or its projection on the second coordinate has an incremental subsequence 
of length C". Clearly 

a(C', 1) = C' a(l, C") = C" a(l, 1) = 1. 
For the induction step, let { W n ^gi} I ._ l be incremental in G. Write 

Ui = {l< i < I : g'ii W' n{l) g' x ) , U 2 = {l< i < I : g'l £ W^g'i] 

clearly Ui U U 2 = {2, ...,/}. If j[/i| > a{C' - 1,C") or \U 2 \ > a(C',C" - 1) then {W n(i) gi} has 
a subsequence of the type we desire. But one of these relations will certainly hold if \Ui U U 2 \ > 
a(C' - 1, C") + a(C', C" - 1), so we can set 

a(C', C") = a{C' - 1, C") + a(C', C" - 1) + 1. □ 

The point here is that we obtain a "large" incompressible sequence in G' x G" . "Small" sequences 
exist since an incompressible sequence in G' (or G") is incompressible in G' x G" with the natural 
embedding. Note that if {W^} , {W"} are F0lner sequences in G', G" respectively then {W' n x W"} 
is F0lner in G' x G" . 

In ^U] it was shown how certain properties of F0lner sequences can be pulled up through exact 
sequences. We rematk that we do not know whether the existence of filling or incompressible F0lner 
sequences in K and G/K for a normal subgroup K of G implies that such sequences exist in G, 
although it seems unlikely that this is the case (again, the sequence in K has the same good filling 
properties in G, but is not F0lner). 

The requirement that {VFn} be C-filling (or C-incompressible) is a property of the entire sequence, 
and in particular implies that each W n is C-filling (C-incompressible) as an individual set. The 
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existence of a constant C such that each W n is C-filling (C-incompressible) does not imply that the 
same about the sequence. One can, however, sometimes pass to a subsequence of the {W n } which 
will be filling. 



Lemma 2.2. Suppose {W n } is an increasing sequence of finite subsets of G with 1q £ W n such 
that 



(1) {Wn} is a F0lner sequence. 

(2) For some constant C, each W n is individually C-filling. 
Then there exists a subsequence {W n {i)} which is filling. 



Proof. Since {W n } is a F0lner sequence, we may select a subsequence {W n (j)} with the property 
that 

tt^-t \{g € W n(l) : W n{i _ 1)9 C W n{i) }\ > -. 
\'"n(i)\ ^ 

Write W[ = W n ^y, we claim that {W(} is filling. Suppose that {W!^fj}j =1 is an incremental se- 
quence. Let 1(1) > . . . > I(N) be all the values of . . . , i(J), and let = {1 < j < J : = k}. 
Since Wj^ is C-filling, we have 

I |J W/(i)/,|>C'^|W /l / j |. 
jeJi jeJi 



Let 



then 



so 



Ei = {g € |J W m fj : W I(2) g C |J W m fj} 
jeJi jeJi 

\Bk\ > | |J W m fj\ - \{g € (J Wj^)/,- : W /(2)5 £ J 
j'eJi jeJi jeJi 

> cEi^(i)/ii-E^d)/ii 

jeJi jeJi 

= §E 

I U ^d)/i \ U wioo/il ^ 1^1 ^ 7 E l^i/il 

jeJi ieJ 2 u...uJ]v jeJi 
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which means that the fraction of UjeJi ^1(1) fj which is n °t contained in any of the "lower levels" 
is at least ^ SjgJi Repeating this argument for 1(1), ■ ■ ■ , I(N) we obtain 

I U w mfi\ * j E \ w ^\- D 

The following is proved similarly: 

Lemma 2.3. Suppose {W n } is an increasing sequence of finite subsets of G with 1q £ W n such 
that 

(1) {Vt^n} is a F0lner sequence. 

(2) For some constant C, each W n is C -incompressible. 

Then there exists a subsequence {W n ^} which is quasi-incompressible . 

The last construction in this section deals with the existence of interpolation sequences. 

Lemma 2.4. Let {F n } be an increasing tempered F0lner sequence. Suppose {Y n } is increasing and 
filling. Then there exists a subsequence {F n u\} of {F n } such that the sequence Y n is an interpolation 
sequence for {F n ^}. 

For the sequence {Y n } to be useful, {F n } must fulfill the hypothesis of theorem 11.111 and so {F n } 
must also be incompressible or quasi-incompressible; but this is not necessary for the proof of the 
lemma. 

Proof. Let {[a<i,A']}£i he an enumeration of the rational intervals, < on < /3j. We select F n r k \ 
and sets Zi t k = Y m (i,k) (1 < i < k) inductively in k, as follows: Given k and assuming that we have 
defined F n (j\ and Zij for every 1 < i < j < k, we select F n ^ and for 1 < i < k so that 

(a) Z^ k satisfies 2 ai|F ™( fc - 1 > 1 < \Z ijk \ < 2 ft|F "( fc - 1 ) 1 . 

(b) F n r k \ is large enough that for alH < k it holds that \F n ^ \ < 2 ai l F ™( fc )L 

( c ) \({Ji<k-l Z i,k-l) F n(k)\ <2|-F n ( fc )|. 

(d) n(k) is large enough so that for 1 < i < k and every n > n(k) there exists m such that 

2 a i\ F n\ <- \Y m \ < 2^'^ n 

One then verifies that {Z^} is an interpolation sequence for {F n ^}. Briefly, (a) ensures that 
(3) of the definition of interpolation sequences holds, while (b) and (c) ensure condition (2), and 
(d) makes it possible to continue the construction ((a) is possible for k because for j < k we chose 
W n u\ to satisfy (d) ). □ 

2.2. The groups Z and Z d . In the case of Z, any increasing sequence of symmetric intervals {I n } 
is 2-incompressible. This is simple to verify. 
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Next, Z d is the product of Z with itself d times, we know from lemma l2~Tl that incompressible 
sequences exist. In fact, 

Proposition 2.5. If {I n } < £L\ are increasing sequences of symmetric intervals, i = then 
the sets W n = In x . . . x 1^ form an 2 d -incompressible sequence. 

Lemma [2,l| gives a poorer estimate for the constant. That the constant 2 d is correct (and optimal) 
for such sequences can be established directly. 

We will call sets of the form in the proposition boxes. If {W n } is an increasing symmetric sequence 
of boxes, then it is easy to see that for any n we can find boxes W n = Y n $ C . . . C Y n u n \ = W n+ \ 
such that < 2. Ordering all the Y n j in a single increasing sequence {1^}, we obtain 

a filling sequence which grows at most exponentially. It is easy to verify that \W n + Y m \ < 2 d \W n \ 
whenever \W n \ > \Y m \; so {Y m } is an interpolation sequence for {W n }. This gives 

Proposition 2.6. For G = 7L d , any increasing sequence {W n } of boxes and any tempered F0lner se- 
quence {F n } (and in particular F n = W n ), T* F ' W ^ > h andll!; F ' W ^ = U* FW ^ = h. 7/lim n ^g^"^ = 

1 then T^ W) = T* F<W) = h. 

This is essentially a reformulation of the results from 8 . 

One can show that certain increasing sequences of symmetric convex sets in Z d are incompressible 
as well. 

2.3. The group Z°°. Consider the group Z°° = 0~ =1 Z. Identify the subgroups Z d x {(0, . . .)} C 
Z°° with Z d in the obvious way. 

Proposition 2.7. Let A C 7L d have full dimension (ie (A) is of abelian rank d). If A is C- 
incompressible then C > d. 

Proof. For a set A C Q d , we say that u € Q d is an extreme point of A if it is an extreme point of 
convqjA 

We prove the following: If A C Q d is finite and has dim (A) = d, and is an extreme point of 
A, then there exists a set {u\, . . . , C —A such that {^4 + Ui} d ^ is incremental. In this case, 

clearly G C\(A + Uj), so A cannot be C-incompressible for C < d. 

The proof is by induction on d. For d = 1 the claim is trivial: take a\ = and a% E A \ {0}. 

Now suppose we have proved it for d — 1. Let A C Q d be as in the claim. Set E = extconv A, 
so by assumption £ E. Since A is d-dimensional, so is E. Select a linearly independent set 
oi, . . . , ad-i G E \ {0}. Write V = span^, {a±, . . . , a^-i}, and assume {ai} was selected so that A is 
on one side of the subspace V, ie there is a linear functional A with A(^4) C Q+ and V = ker A. 
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Consider A' = A n V. This is a finite set of full dimension in V containing as an extreme point. 
Therefore there exist ui,...,Ud € —A' with {A' + Ui}^ =1 incremental. Clearly {^4 + «j}f =1 is also 
incremental. Now we need only note that for any u € A\A', we have that —u ^ uf =1 A + Ui because 
A(u) < whereas A(a + Ui) = A(a) + > for all a G A and i = 1, . . . , d. Thus {m, U2, • • • , Ud, —u} 
is the set we are looking for. This completes the induction step. 

The proposition now follows from the fact that it holds for A iff it holds for every translate of A, 
and we can always translate A so that is an extreme point. □ 

Corollary 2.8. There are no incompressible F0lner sequences in Z°°. 

Proof. If {F n } is a F0mer sequence in Z°° then for any d, F n must be eventually d-dimensional. □ 

It seems likely that there is a similar bound on how filling a ci-dimensional set in Z d can be, and 
this would imply that there are no filling F0lner sequences in Z°°, but we do not have a proof. 

On the bright side, there do exist quasi- filling F0lner sequences in Z°°, as we now demonstrate. 

Suppose 1$ C Z are symmetric segments and W n = x . . . x I^ n ^ C Z°° is an increasing 
sequence of finite-dimensional boxes. We know that each set W n is 2 ~ d ( n ') -filling in Z d (") and it 
follows that the same is true as subsets of Z°°. In order for {W n } to be quasi filling, we must have 

^2 d (") • \W n \~ a < oo 

n 

for every a > 0. For this it is enough that {W n } grow exponentially and d(n) = 0(log |W„|). We 
would also like {W n } to be a tempered F0mer sequence. Such a sequence can easily be constructed. 
For example, we may take 

W n = [_2™ 2 ;2 n2 ] [loSn] . 

{VF n } is then an increasing quasi-filling F0mer sequence. For temperedness it suffices to check that 
IW^Wnl < C\W n \. We have 

W n-l W n = I" 2 "' - 2 (ri " 1)2 ,2™ 2 + 2( n - 1 ) 2 ][ 1 °S n l 

so 

\W~\W n \ < / 2 -(2" 2 +-2("- 1 ) 2 ) + l 
\W n \ ~y ^ 2 • 2 n2 + 1 

< (l + 2 - 2 «+ 2 ) [logn] _ 1. 

One also readily verifies that {W n } satisfies ^fgjpp^ ■ —> 1- We therefore have 

Proposition 2.9. There exists a tempered F0lner sequence {W n } in Z°° such that for any tempered 
sequence {F n } (and in particular F n = W n ) we have T* F ' W ^ = TT pw ^ = h. 
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2.4. Locally finite groups. G is locally finite if every finitely generated subgroup is finite. 
Since we only consider countable groups this is equivalent to saying that there are finite subgroups 
G± < G*2 < . . . whose union is all of G. 

It is easy to see that such a sequence {G n }%Li is 1-incompressible (and hence 1-filling). Translates 
of GiS are just cosets, so if / ^ Gig then Gif n Gig = 0. Thus if {G n ^gi}f =1 is incremental, the 
G n u\gi are pairwise disjoint, so every f € G belongs to at most one translate. Such a sequence 
{G n } is also clearly a tempered F0lner sequence, since every g G G is eventually a member of G n 
for large enough n. 

If in addition {Gi} grows slowly enough, then {Gi} is its own interpolation sequence. Thus 

Proposition 2.10. If G\ < G2 < ■ ■ ■ and G = UGi, then for = Wk = Gk we have that T* > h 
and U* < h. If in addition 1 ° g|( f'+ l1 -» 1 then T* = T* = h and U* = U* = h. 

— J log|&»| 

In the case where {Gi} grows too quickly to be its own interpolation sequence, we can still show 
that for a suitably chosen subsequence (G^)} there exists an interpolation sequence. This is based 
on 

Theorem. (P. Hall and C. R. Kulatilaka, pQj Every infinite locally finite group has an infinite 
abelian subgroup 

To use the theorem we need 

Proposition 2.11. An infinite locally finite abelian group A possesses an increasing sequence of 
finite subsets (Y n )^ =1 each of which is indivdually ^-incompressible and < \Y n \ < 2\Y n -i\. 

Proof. We can find finite subgroups 1 < ... < A n < ...A such that A n+ \/A n is cyclic. Thus 
A n+ i = A' n+l © C n+ i and A n = A' n+1 © k n C n+ i for C n+ i some cyclic group and k n G N. Let Z U)i 
be a 2-incompressible sequence in C n+ \; then Y n ^ = A n U {A' n+1 x Z n ^i) is 4-incompressible, and 
ordering all the Y n> i by inclusion gives the desired sequence. □ 

Now if G is locally finite, and A < G an infinite abelian group, we can find a slowly-growing 
sequence (Y n ) of finite subsets of A each of which is 4-incompressible. Thus (Y n ) enjoys the same 
properties as subsets of G, and we may apply lemma l2~H to obtain 

Proposition 2.12. For any locally finite group G = U^ =1 G n there exists a sequence G n ^ such that 
for F, t = G n{{) , 

Tj( F ' F ) — TJ* — h 

u * — (F,F) — 

We remark that in order to obtain an interpolation sequence in G one does not need the full force 
of the Hall-Kulatilaka theorem, but can rather use the fact that for every N there is a K such that 
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every finite group H with \H\ > K has an abelian subgroup of size at least 7V, and therefore infinite 
locally finite groups have arbitrarily large abelian subgroups. This can be proved by elementary 
methods. 

2.5. Groups with polynomial growth. Let G be a finitely generated group and r C G a finite 
set of generators. For convenience we assume throughout that T is symmetric, ie that F = T^ 1 . For 
every element g G G define the length of g (with respect to T) by 

£ r (g) = mm{k : g € T k } 

so £r{g) is the least number of elements of T which need be multiplied together to give g. 
It is easy to check that 

dr{gi,92) = MffV) 

defines a left invariant metric on G (this is just the shortest distance metric on G"s Cayley graph). 
The ball of radius n in this metric centered at the unit element of G is 

B v n ={geG : £ r (g) < n}. 

The growth function of G (with respect to T) is defined by 

lv {n) = \B T n \ = #{geG : £ r (g) < n}. 

A finitely generated group G is said to have polynomial growth if its growth function 7 r is 
bounded by a polynomial, and subexponential growth if ^log7 r (n) — > 0. The property of having 
polynomial/subexponential growth is really a property of the group and does not depend on the 
particular generating set T. To see this, let ri,r 2 be two finite generating sets for G as above. 
Then there is an TV for which Ti C (T 2 ) N , so (Fi) n C {T 2 ) Nn . This means that 

and so 

7 ri ( n ) < lr 2 ( Nn )- 

From this, and the fact that the roles of Ti and T 2 are interchangeable, it follows that 7 r is 
bounded from above by a polynomial p(x) of degree d iff 7 ra is also (though not necessarily the 
same polynomial), and similarly ^ log 7 r (n) —>■ iff ^ log 7 ra (n) — > 0. 

Though formally finite groups have polynomial growth, we will assume from here on that our 
groups are infinite. We also assume that T is a fixed finite symmetric generating set for G and write 
7 for 7 r and B n for B^. 

One can show using elementary methods that groups with subexponential growth are amenable. 
Indeed, 
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Proposition 2.13. There is a subset I C PJ of density 1 such that {Bi}i e j is a F0lner sequence. 

Proof. For suppose that 7 grows subexponentially. We will find a subsequence {B nk }^ =1 of balls 
which is a F0lner sequence. 

Write a(n) = 2iS±*J an d for e > set I £ = {n G N : a(n) > 1 + e}. Since 7 grows subexponen- 
tially, for every e > 0, I £ must have zero density; ie 

lim : -\I e n [0;n]| = 0. 

ra-»oo n + 1 

Choose JVfc growing very rapidly such that 

^-^|/ lA n[0;iV fc ]| < 1/k. 

And define I = Uk(I\/k D [iVfc_i; iVfc]). Since if n is such that a(n) > 1 — ^ then gi? n C B n+ \ and 

2 

|gS n A-B n | < \B n+ i \gB n \ + |.B n+ i \-B n | < — |-B n |- 

k 

We see that (B n ) n& i is F0lner, and clearly I has density 1. □ 

We will need the following stronger fact: 

Theorem. and |3] ) If G = (T) has polynomial growth then there is an integer d and constants 
c\,C2Such that c\n d < 7r(^) < c 2 n d . 

This implies that 

\B- l B n \ = \B\\ < c 2 (2n) d < —\B n \ 

ci 

so the F0lner sequence from the proposition is in fact tempered. 

We also conclude that every ball B n is 4 d c 2 /c\ incompressible. For suppose that {B n gi}f =1 is 
incremental and 1q G V\iB n gi. Then {B n / 2 gi} is a disjoint collection contained in B 2n gi so 

N 

N ■ Cl (^) d < N ■ ln/2 = ^ 7n/2 < 72n < c 2 (2n) d 



and therefore N < — 4 d . 

— Cl 

We thus get 

Proposition 2.14. IfG has polynomial growth there is a F0lner sequence {F n } such that r^rj log ' 
h almost surely. 

Using the construction from 12.21 and 12.41 we also get 

Proposition 2.15. If G has polynomial growth there is a F0lner sequence {F n } such that U^ FF ^ = 
lli F ' F ^ = h almost surely. 



Return times, recurrence densities and entropy 



19 



We also note that by Gromov's theorem [3] every infinite group G with polynomial growth has 
an element of inifinite order, so the next section applies. 

2.6. Recurrence in subgroups. It is simple to verify that all the various filling-type properties 
described so far are preserved in moving from subgroup to supergroup. This immediately gives 

Proposition 2.16. If H < G and {Wn\ is a filling or quasi-filling sequence in H then for any 
F0lner sequence {F n } in G, U* FW s < h. If in addition — > 1 the T* F ' W ^ = T^ FW ^ = h. 

To obtain the same result for £/*, more care is needed, because one cannot automatically transfer 
an interpolation sequence from a subgroup to supergroup; this is due to the fact that condition 
(3) of the definition of interpolation sequences (definition ll.lUjl depends on the F0lner sequence 
involved, and not only on the window sequence, and the F0lner sequences in the supergroup are 
quite different from those in the subgroup. However, if {Y n } satisfies (1) and (2) of the definition 
of interpolation sequences with respect some to and in addition lim^^oo ^"^j?"^ = 1, then 

(3) of the definition of interpolation sequences is satisfied for any sequence {F n }, and so {Y n } is an 
interpolation sequence in G for {F n }, {W n } for any sequence {F n } in G. This gives 

Proposition 2.17. If H < G and {W n } is an increasing incompressible sequence in H, and there 
exists a sequence {Y n } satisfying (1) and (2) of definition M.im and such that ^^^j 1 — > 1, then 

for any F0lner sequence {F n } in G we have U^ F ' W ^* = lli F ' W ^ = h. 

Corollary 2.18. If G contains an element of infinite order then there exist in G window sequences 
{Wn} for which for any tempered F0lner sequence {F n }, U* = JJ^ F,W ^* = T* F ' W ^ = T^ F,W ^* = h. 

Proof. If (g) = Z take W n = [— n;n] C (g); since {lf n } is incompressible in Z the same is true in 
G, and it is its own interpolation sequence. □ 

We remark that in the case that {W n } is contained in some proper subgroup H of G, although 
we are looking at recurrence in H, the patterns we are looking at come from a F0lner sequence in G, 
which is not contained in H. We emphasize that these results do not follow from an application of 
our recurrence theorems to the -ff-process which arises from the restriction of the G-action to H; in 
fact the H -process derived from an ergodic G-process need not be ergodic, and even if it is it need 
not have the same entropy as the original process. This observation means that we cannot weaken 
the requirement that F n be a F0lner sequence. If for example {F n } were a F0lner sequence in some 
proper subgroup H of G, our results as applied to the -ff-process and the ergodic component to 
which x belongs would show that U* etc. do converge, but their value depends on x and may 
be different from the entropy of the original process. 
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2.7. Return times and densities may misbehave for "nice" window sequences which 
aren't filling. It is natural to wonder which of the theorems about return times and densities are 
true under weaker conditions than those stated, and in particular whether the requirement that 
{Wn} be filling may not be replaced with a weaker condition. One condition which has been used 
successfully in the proof of other pointwise theorems is that of being a Templeman sequence, ie an 
increasing F0mer sequence with (W^W^ < C|W n | (note that this is implies temperedness of the 
sequence {W„}). In this section we show that even a stronger condition is not enough, and that 
the almost-certain bound 

lim inf inf — — — log u[ F ' F ^ (x) <h 
k n \Fk\ 

may fail even if {F n } is a symmetric, increasing Templeman sequence of all orders, ie for all d there 
is a constant such that |(-Fn) d | < Ca\F n \ for all n. 

Let S 1 = {z G C : \z\ = 1} be the unit circle in the complex plane and let Tz = e 2nt9 z be an 
irrational rotation, ie 9 G R \ Q. Let X : S 1 -> {0, 1} be the map z i— > sgnlmz, and consider the 
process X n = T n X defined on S l when S l is equipped with Lebesgue measure. It is well known that 
this process is ergodic and has zero entropy. Our goal here is to construct a sequence of increasing, 
symmetric finite sets F n C Z such that {F n } is Templeman of all orders and \F n \ = 2n, but for 
which there is a sequence of indices ki such that T^ F ' F \z) > 2' Ffc <' for every z G S 1 , so t^t logT^'^ 
does not converge even in probability to the entropy. This implies that the asymptotic upper bound 
for inf n — log ujf^ fails as well. 

Let e n \ be a decreasing sequence of positive numbers such that for every z G S 1 , if < \k\ < n 
then \T k z — z\ > e n . For any zq G S 1 one can find such an e n , since T has no periodic orbits. Since 
T is an isometry of S 1 such a choice of e n is good for all z G S l if it is good for zq. 

Let £ n be an increasing sequence of positive integers such that any realization (X_£ n (z), . . . , Xp n (z)) 
determines z up to a distance of e n . More precisely, we require that if z, z' G S 1 satisfy X^{z) = 
Xk(z') for all — n < k < n then \z — z'\ < e n . Such l n exist because if we denote by V the partition 
of 5 1 determined by X then \Z e _fT n V partitions S 1 into segments whose length tends to as i — > 0. 

Finally, write 

J n = jfc G Z : \z- T k z\ > e n (for any or all z G S" 1 )} . 

Note that J n are symmetric (because T is an isometry) and that since e n are decreasing, J n C J n+ \. 
Also note that if I = [a; b] C Z then 

k,m £ I \ J n => |z — T fc ~ m z| = |T fe z - T m z| < 2e„ 

and so if e n i > 2e n then n' < \k — m\. Thus if |/| < n' then \ J n P\ I\ > \I\ — 1. It follows that the 
relative density of J n in any segment / of length n' — 1 is at least 1 — -rrr . 
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We define the sets F n inductively. At each stage we obtain F n from F n _i by appending to F n _i 
some symmetric pair of numbers ±m. Given F n and a symmetric set £CZ, when we will say that 
we "add E to F n " we will mean that over several stages we add to Fi the smallest pair of numbers 
(in absolute value) in E \ Fi until E is exhausted or some condition is met. 

We start with Fq = and add to it the segment [— £i",£i] to obtain the sequence Fi, . . . , F^ . We 
now add J\ until we obtain a set F ni such that |.F ni | > 2^1'. Now we add [— ^2^2] to F ni and 
obtain F^ 2 \ then add J2 to Fk 2 until we obtain a set F n2 with \F n2 \ > 2^ Fk 2^. We proceed in this 
manner, alternately adding to obtain F^ and then Jj to obtain F ni with \F ni \ > 2^ Fk ^. 

The sequence {F n } thus constructed is clearly increasing and symmetric, and satisfies \F n \ = 2n. 
Next, we claim that T^. (z) > 2' Fk i* for every z £ S 1 . To see that this is true for all i, note that 
z{Fj ti ) determines z up to £j because [— £i",li] Q F^ , while for every m £ F n . \ F^. we have m € Jj, 
so \z - T m z\ > Si. Therefore T m z([-^;^]) / z([-£f,£i]) and this forces T m z(F kt ) ^ z(F h ). Since 
\F ni \ > 2^1, this proves the claim. 

As for \F n \ being Templeman, it suffices to show that linin-^oo — — J . P = 1; this implies 

J ' rn<ix 71 niiri J^fi 

both that {F n } is a F0lner sequence, and that \F^\ < Cd\F n \ for some constant C^, for all n. But 
this follows from our remarks about the relative density of J n in certain segments. 

We remark that it seems to be more difficult to construct an example of such a sequence {F n } for 
which the lower bound for Uk, n fails. Perhaps the lower bound U* > h holds more generally than 
the upper bound, as happens in the case of T^. we have so far been unable to determine whether 
this is so. 

3. Upper bounds for return times and densities 

In this section we prove the upper bounds T* < h and U* < h as described in the introduction. 

3.1. The upper bound for U*. We begin with some further remarks about the definition of 
U}. n . Recall that in section HTH we defined Uf. n to be size of the maximal collection of repetitions of 
x(Fk) in x(W n ) satisfying (a) the collection includes the trivial repetition x(Fk) itself, and (b) the 
members of the collection are pairwise disjoint. If we drop these restriction we obtain the quantity 

V k , n (x) = V^ W \x) = t^t-M € W n : x{F k ) repeats at /} 

I VVfi I 

which is related to Uk, n by 

(3.1) Uk, n < Vk,n < \F k 1 F k \Uk, n < Wk^Uk^n- 

The middle inequality is due to the fact that from any collection of right translates of F^ we can 

obtain a pairwise disjoint sub-collection at least - — =t — - the size of the original collection which 

\F h F k \ 

includes any single prescribed member of the original collection (which in our case we take to be 
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the set F)., representing the original pattern). From this relation we see that in order to prove that 
U* < h, it suffices to prove that V* = limsup n sup fc — m-j log Vfc jn < h. 

The key lemma which makes filling sequences special is the following, which states that if a set 
E has a low "local" density with respect to {W n } then it has a low "global" density: 

Lemma 3.1. Suppose {W n } is a C -filling sequence and E C G finite. If for each f £ E there is 
an index n{f) such that — r|W / n(/)/ H E\ < a then 

I n ( / ) I 

\E\ a 

< 



\Ufe E Wn(f)f \ -C 
Note that we always have < C < 1. 

Proof. Order E by decreasing value of ro(-): E = •••,//} and write n(/j) = n(i). We select 
a subsequence C . . . , //} inductively. Set i(l) = 1 and let i(J) be the first index such 

that ^ Ujfc<j ^n(fe)/fcl the process ends after J steps when E C U/=i Wn(i(j'))/i(j) (recall that 
1g G W n for each n, since {W^} is filling). Clearly {Wn^))/^')}^! is incremental. Now 

J J j 

\E\ < Y, \W n {i(j))fi(j) n E \ ^ a Y,\ W nW))\ ^ U 
3=1 3=1 3=1 

(the middle inequality is because by hypothesis \W n u\fi C\E\ < a\W n u\\ for every i = 1, ... , J and 
the last because {Wn} is C-filling). Since 

J / 

i=l i=l 

the lemma follows. □ 

We apply lemma ETTl in the following situation: Consider an ergodic G-process defined on ($7, J 7 , P) 
with entropy h. Suppose x € Q, k < L « N are fixed, and tp : Fj. — > E is some F k pattern. Let E 
be the set of repetitions of (p in x(Fn) with the additional property that near them cp repeats not 
frequently enough with respect to Vfc jn : 



E={feF N 



fx € [<p] and there exists n{f) < L I 
s.t. V kMf) (fx)<2-( h +W*\ J ' 

Since for any f E E we have /cc G [(/?] and j^ 1 | [_E? n W n (j)/| < 2~^ l+e ^ Ffc l, from the lemma we see 
that 

1 \e\ < 1 . 2-^)1^-1 

\u feE w n{f) fy 1 c 
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If we assume that N is large enough so that |(U n <£,W 7l )i'jv| < 2|F/v| (this happens eventually since 
{F n } is a Folner sequence), then since U/ e £;Wn(/)/ C (U„<i,W n )Fjv, we see that 

' \E\ < 2 \E\ C 2 2- (h+£)m 

1 \ U feE W n{f) f\ l 1 C 



Given some <E> C and repeating this argument for every i^efwe have 



l^| 



#<feF N 



fx <G [<&] and there exists n(f) < L 

S-t. n.,n(/)(/x) < 2-^)1^1 



< _ . 2 -( A + £ )l^l . |$|. 

C 



Theorem 3.2. If {F n } is a tempered F0lner sequence and if {W n } is filling, then 



U*(x) = limsup sup — — -r log ujf^^ (x) < h 

k— >oo n \^k\ 



almost surely. 



Proof. It is enough to prove that 

V* = limsup sup -— — log \V ktU \ < h 
k n \rk\ 

almost surely. Suppose to the contrary that for some e > and measurable non-null set B C Q, for 
every x £ B 

lim sup sup - — — log Vu F ' W ^ (x) > h + e. 

fc^oo n \F k \ 

Let P{B) > p > 0. Then from the SMB theorem, there is a measurable subset Bq C £>, P(B$) > p, 
a sequence {^fe} of sets C S^ fe and integer Li such that for k > L± it holds that < 

may further be assumed that for some L2 >> L\, 
for any x £ Bq there are indices k(x),n(x) with L\ < k(x) < L2 and n(x) < L2 such that 

V k{x)Mx) (x)<2-( h +^ F H*)\. 

Choose N very large with respect to L2 so that Fn is very Wl 2 -invariant, and large enough so 
that there is an x <G ft such that 

-!-#{/ eF N : fx£B }>p 

(by the ergodic theorem such x's are bound to exist for iV large enough). 

Fix x G for which the above holds. For / € Fjv such that fx G i?o, write k(f) = k(fx), 
n(f) = n(fx). Define 

E = {f £ F N : fx £ B }, 
E k = {f £ F N : fx £ B and k(f) = k}. 
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Clearly E C (U^^-Efc), so by our choice of x we should have 
On the other hand, according to the discussion after lemma UTTl 



— \E k \ < -2-^+^11$ J < l(2- £ / 2 )l^l 
|Fjv|' fc| " C ~ C ' 

so that 

(3.2) ^ |UE * I = ^ E l^<C<.(2-/Y'. 

for some constant C" depending only on e, so if Li was chosen large enough with respect to e (recall 
our assumption that that \F n \ >n), this is less than p, a contradiction. □ 

Esssencially the same proof gives us 



Theorem 3.3. Let {F n } be a tempered F0lner sequence in G. If {W n } is a sequence such that W, 
is C„- filling and {n 
for almost every x, 



is C n -filling and {n(k)} is a sequence of integers such that YlkLl 2 c ~ ^ 00 f or ever V « > 0, then 



limsup _ log[/ (W( x) < h _ 

The proof uses the fact that lemma has an analogue when {IVn} is replaced with a single 
C-filling set W, namely that if for a finite subset E C G we have ^^j^ < a for every / £ E then 

^ —7* (to see this, apply the lemma with W n = W ). Using this the proof now follows exactly 



\WE\ - C 

the lines of the proof of theorem I3.2| the relation Yl 2 g < oo is used to show that the middle 
term of equation (|3.2|) is small when L\ is large enough. 

3.2. The upper bound for T*. In order for t[ F ' W ' to be meaningful the window sequence {W„} 
must grow slowly enough to detect a too-soon return. What slowly means here is that for every h 
there exist window sets of size approximately 2 h ' Fk ^ for large enough k. One condition which ensures 
this without reference to the sequence {F n } is that I — > 1. With this condition in place, one 

can deduce the bound for T* from the bound for U*: 

Lemma 3.4. Suppose {W n } is an increasing sequence such that ^ — >• 1. For x E fl, if 

U*(x) < h then T*(x) < h. 

Proof. If Tfc(aj) = I ^JEfe (a;) I > 2( h+£ ^ Fk \ for infinitely many k, then the growth condition implies that 
|Wii fe (a;)-i| > 2^ +£//2 -" i?fc ' for infinitely many k. But for such a k the only occurence of x(Fk) in 



Return times, recurrence densities and entropy 25 

Wji k f x \_x is the original pattern, so \U k ^ Rk ^ x -j_ 1 (x)\ < 2~( h+£ l 2 ^ Fk \ for infinitely many k, which is 
impossible because U*(x) < h. □ 

Theorem 3.5. Let {W n } be a quasi-filling sequence satisfying -^^Tpp^ • — ¥ 1- Then for any tempered 
F0lner sequence {F n }, Tf pw ^ < h almost surely. 

Proof. Let e > 0. Define 

n(k) = minjn : \W n \ > 2 ( - h+£ ^ Fk ^ . 
For a > we have, since {W n } is quasi-filling, that 



Cn(Jfe) Cn(fc) Cn(fc) 



and this now gives, via theorem I3.31 that 

lim sup — J log U&yj (x) < h 

k \ r k\ 

almost surely. This in turn implies that T* < h almost surely, by reasoning like that of the lemma 
E3l above. □ 



4. Lower bounds on return times and densities 

In this section we prove the bounds T* > h and [/* > h. The bound T* > h is proved by 
constructing a code for the process which beats entropy if T* ^ h. This is close in spirit to the 
proof of T* > h originally given by Ornstein and Weiss for the cases G = Z and G = Z d , but some 
new combinatorial machinery is needed to make the construction possible for general groups and 
general window sets. We first give a brief description of the coding ideas we will need, and then 
explain the combinatorics and detailed construction of the code. We conclude with the proof of the 
bound U* > h, which relies both on the filling ideas used in the last section and on the lower bound 
% > h. 

4.1. Coding and entropy. For completeness we take a detour and discuss the connection between 
entropy and efficient coding of a process. 

A useful characterization of the entropy of a process is as the lower bound of all achievable coding 
rates for the process. An F n -code is a map c : S Fn — > {0, 1}* = U n >o{0, 1}™. Thus c encodes each 
pattern 93 : F n — > S in a binary string (=sequence) c((p), which is called the codeword associated 
with (p. If c is an injection the coding is said to be invertible or faithful. For x £ £1 and an F^-code 
c, we write c(x) instead of c{x{F n )). If c n is an F n -code, we say that {c n } is an {F n }-code. {c n } is 
invertsble if every c n is. 
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The number of bits (=symbols) in the codeword c(tp) of tp is called the length of the codeword 
c(ip) and is denoted by £(c(tp)). Note that the codeword length of different patterns may vary. For 
an F n -code c and process {X g }, the average number of bits per symbol used by the code is the 
quantity p(c) = r^-r J £(c(x))dP(x); this is the average number of bits used in the c-encoding of a 
random pattern x(F n ), normalized by the number of symbols in the -F n -pattern x(F n ). 

For r > 0, we say that r is an achievable coding rate for the process {X g } if there exists an 
invertable {-F n }-code {c n } such that limsup/9(c n ) < r. This means we can represent arbitrarily 
large patterns from the process using approximately r bits for every symbol of the pattern. The 
coding rate of the process {X g } is the infimum of achievable coding rates: 

R = inf {r > : r is an achievable coding rate for the process • 

It is not hard to see that the coding rate of a process is the same as the process entropy h 
(incidentally, this shows that R does not depend on the sequence {F n }). To see that R> h, let {c n } 
be a sequence of invertable codes such that limsup,^^ p(c n ) < r. Then there is some e > such 
that for infinitely many n there is a measurable set A n C Q with P{A n ) > e, and j^£(c n {x)) < r 
for every x G A n . Since c n is an injection, and for every x G A n we have £(c n (x)) < r\F n \, we have 

{c n (x) : x G A n } C (J {0, l} k 

k<r\F n \ 

so every x £ A n belongs to at most one Ylk<r\F n \ 2 fe < 2 • 2 r l F ™l F n -atoms. The Shannon-McMillan 
theorem now implies that h < r. 

To see that h > R we formulate the following simple lemma, which enables us to turn a non- 
faithful code sequence into a faithful code sequence. 



Lemma 4.1. Suppose that {c n } is a sequence of F n - codes and & n C T, Fn is such that c„|$ n is an 
injection and j^£(c n ((p)) < r for every tp G <!> n . Assume also that P([$ n ]) — > 1. TTjen R < r. 

Proof. Let e n be an enumeration coding of S^™, ie it assigns to tp G a binary string representing 
the index of tp in some fixed enumeration of T, Fn . The code e n is faithful. Now define a code {c ra } 
by 

0c n (p) p G 
le n (<^) p i<& n 



Cn{f) = | 

{c n } is clearly an invertible code. Moreover, 
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SO, 

P(c n ) = 7^7 Jt(&n{x))dP 

< -^-+r-p([^ n }) + \io g m]p(n\[^ n }) 

\-fn\ 

— > r 

as desired. □ 

If we choose C Y< Fn in such a way that r^-r log I'&nl — * h and P([<J> n ]) — > 1, choose c n to be 
an enumeration coding of <£ n and define it arbitrarily of T, Fn \ <3? n , then the lemma implies that 
p(c n ) < h + e for every e > 0, so R < h. 

We conclude this section with a combinatorial lemma which will be used later. Its importance 
is that it enables us to describe small random subsets of F n at a low rate by using an enumeration 
code. 

Lemma 4.2. Let < A < 1/2 and F a finite set. The number of subsets of F whose size is at most 
X\F\ is bounded by 2 S( - X ^ F \ , where lim^o S(X) = 0. 

Proof. The number of such subsets is 

V (\ Fn \\ < 2 (- A1 °g A -( 1 - A ) 1 °g( 1 - A ))l i;, l 

m<X\F n \ V 7 

The bound follows from Stirling's formula; or see p. 52] for an elementary proof. □ 

4.2. The lower bound for T* (Part I). In this section we begin the proof of 

Theorem 4.3. If {F n } is a tempered sequence and {W n } is an increasing sequence of finite subsets 
of G, then 

rf^=liminf^logTfW>/ t 

almost surely. 

We will need some notation. 

Definition 4.4. Let {F n } be a fixed sequence. A cover v is a collection of sets of the form F n f for 
some n G N and / € G. The set E C G of /'s such that some F n f is represented in v is called the 
set of centers of v and is denoted by dom v. We also say then that v is a cover over E. 

It would be more precise to say that a cover v over E is a collection of pairs (n, /) € N x E, 
because we wish to allow more than one set F n f over each / £ E, and also since the set F n f does 
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not, in general, determine n and / uniquely. However, it is more convenient to think of v as a 
(multi)set with elements of the form F n f, and this is what we will do. 

We write Liu for the union of members of u. When we wish to specify the elements of u by name, 
we will write L) u F n f, \F n f\, etc. This is imprecise because the index n depends on / but we 
omit this when convenient. 

Fix {F n } and {lf„}. Let i £ SI, / 6 G and n G N, and suppose R n (fx) < oo. There then exists 
an element /' G G at which fx(F n ) repeats, and is closest to / with respect to {W„}. To be precise, 
there is an /' such that 

(1) fx G [fx(F n )] (ie the F n -pattern at / in x repeats in /'). 

(2) f'f- 1 G W Rn{fx) (ie the offset from / to /' is in W Rn(f(x)) ). 

In general there may be more than one /' with these properties. For fixed x and n, we assume some 
choice has been made, and denote this element by f x > n . 

For a set F n f such that R n (fx) < oo, we write (F n f) x = F n f n ' x . The (partial) map F n f i— > 
(F n f) x is called the displacement arising from x. Strictly speaking, the map F n f i— > (F n f) x is 
not well defined since the set F n f does not uniquely determine n and /. However, n and / will 
always be clear from the context so no confusion should arise. Finally, for a cover u such that 
R n {f x ) < oo for every F n f G u we write u x = {(F n f) x : F n f G u}. Thus v x is a cover over the set 
{f x ' n : F n f G u}. One should think of the displacement v x of u as the cover obtained from u by 
moving each set F n f G u to one of the nearest repetitions of the pattern induced on it by x. 

We will only be working with covers u and x G 0, such that R n {fx) < oo for every F n f G u, and 
avoid further mention of this fact. 

We can now state the idea behind the proof of theorem 14.31 Suppose by way of contradiction 
that P(T* < h — e) > p > 0. We will try to construct an F/v-code cat which beats entropy. By the 
ergodic theorem, for almost every x and large N at least a p-fraction of the / G -F/v are such that 
T n (fx) < 2^ h ~ £ ^ Fn ^ for some n = n(f) G N. In order to describe the pattern x(F n f) it is enough 
to describe the pattern x(F n f x ' n ) and the element f x ' n f~ 1 . Since the latter is in W Rn ^ x ) and this 
set is of size at most we can describe it in approximately (h — e)\F n \ bits. Thus if the 

pattern x{F n f x,n ) were known, the pattern x{F n f) could be coded in h — e bits per symbol. 

Let E C Fn be the set of / G Fjv for which there exists such an n{f) with R n ^(fx) < h — e. 
Let v = If it should happen that (a) the collection v is pairwise disjoint, (b) Uv is 

disjoint from Liu x , (c) we can code x(Fn \ Liu) at, say h + pe/2 bits per symbol, and (d) Liv is at 
least a p-fraction of Fn, then the discussion above shows that we can code x(VJv) at around h — e 
bits per symbol. Given that the above hold, we can then code x{F^) at a rate of h — pe/2 bits per 
symbol. 
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Conditions (c) and (d) are not to difficult: (d) would follow from (a) and the fact that E is a 
p- fraction of F/v, and (c) can be achieved because Fjy \ Uv can be mostly covered by entropy- typical 
sets when N is large, and a standard argument then gives that x(Fn \ Uv) can be coded at a little 
more than h bits per symbol. The difficulty in realizing this outline is that in (a) and (b) cannot in 
general be achieved. 

The proof of the return times theorem for Z and Z rf given in [H] avoids this problem by allowing 
the elements of u and v x to intersect arbitrarily, and employs a more elaborate coding scheme which 
uses the order structure of 7L d in an essential way. This method doesn't generalize to other groups. 

For our proof we will drop the disjointness requirements (a) and (b) and instead require only 
"almost disjointness". To be precise, 

Definition 4.5. Let e > 0. A sequence {Hi}i^j of finite subsets of G are said to be e-disjoint if 
there exist subsets H[ C Hi such that {H'^i^i are pairwise disjoint and \H[\ > (1 — e)\Hi\. We say 
that a cover v is e-disjoint if the collection {F n f € v} is e-disjoint. 

If {HiY i=1 has the property that for each i < I, \Hi n (Uj^Hj)] < e\Hi\ then the collection is 
e-disjoint; simply set H[ = Hi\ Uj^Hj. 

An important property of e-disjoint collections is that the size of their union is almost the sum 
of their sizes. To be precise, let {Hi}i e i be e-disjoint and H[ C Hi as in the definition. Then we 
have 

U H t \ > i u = E i^i ^ ( x - £ ) E 

iei iei iei iei 

Clearly for two e-disjoint collections {Di} and {Ej}, if DidEj = for every i, j then {Di}U{Ej} 
is an e-disjoint collection; and so for any funion of pairwise disjoint e-disjoint collections. 

Definition 4.6. For x G $7 and a cover v = {F n ^fiY i _ 1 over E = . . ,//}, we say that the 
pair (u, v x ) are e-disjoint if for every 1 < i < I, 



j<i 



< e\F n (£)fi\. 



According to the discussion before the definition, if {v, v x ) is e-disjoint then v is e-disjoint. Note 
that in order for the statement that {v,v x ) is e-disjoint to be meaningful, the set of centers of u 
must be linearly ordered. 

The point of this definitions is the following: 

Lemma 4.7. Let 0<e<^,x£Q and suppose v = {F n ^fiY i _ 1 is a cover over E = {/i, . . . , //} 
such that (v,v x ) is E-disjoint. Then there exists a set A C [Ju satisfying \A\ < 2e| U v\ and such 
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that, given the patterns x(Ui/ x \ Uv) and x(A), and given the map fi i— > / x > n M f we can reconstruct 
the pattern x(\Jiy). 

Proof. Set 

Hi = F n (i)fi \ \j(F n (j)fjT- 

j<i 

Since (V, v x ) is e-disjoint, \Hi\ > (1 — e)\F n ^\, and for j > i we have (F n ^fi) x PI ilj = 0. Define 

7 

4 = (Ui/) \ U 
1=1 

Using the fact that v is e-disjoint (since {y, v x ) is) , 

\A\ <£^|F„ (i) /;| < Y37' U ^I < 2£ |Uf|- 

It remains to show that given x(A), x(Uu x \ Uu) and the map / i— > f x > n (f) we can deduce x(}Jv). 

We claim that for any i if we know x{\Jj < iF n ^fj) then we can deduce x(F n ^fi); this will 
complete the proof. Since f*' n ^ x(F n n\) = fix(F n ^), and we know fi and f^ ,n ^\ it suffices to show 
that we can deduce x((F n ^fi) x ). Now since by assumption we already know x^j^F^j^fj), and 
since F n{i) fi n {F n ^fi) x = 0, all we need do is find x((F n ^fi) x n \J j:>i F n ^fj). But this is known, 
since {F n ^fi) x n (Jj>j ^nO')/j ^ A and we know X ( A )- D 

In order to use the lemma 14.71 to produce good codes we need to know how to produce large 
covers v such that (v, v x ) is e-disjoint. We address this next. 

4.3. Disjointification lemmas. Our methods in this section follow those in 
We first need some more notation. If u, fx are covers, their join is 

v V fi = {F n f : F n f £v or F n f G fi} 

(we use this notation instead of the more natural v U fj, to avoid confusion with the set Uf defined 
above in 14. 4|) . The join of several covers u\, . . . , v\. is denoted by Vi=i v i- 

The restriction of a cover v over E to a subset of -E' C E is denoted by v\e' = {F n f G v : f G -E'}. 
We say that ^ is a subcover of v if fi Q v, ie if -F n / € ^ whenever F n f G /x. Thus if i/ is a cover over 
73 and E' C 75 we have C i/. 

We also write minf = min{n : F n f G i/}, and similarly maxiA 

Definition 4.8. Let {-F n } be a F0lner sequence. The blowup of F n is = (Uk< n Fr 1 )F n . For a 
cover u, write U + z^ = U u F+f. 

The point of this is that if F^f and -F n g are two translates with k < n, then / ^ -E+g iff 
Fkfr\F n g = 0. Thus if F^f is a set and v some cover, and A; < mini;, then / ^ U + z^ iff i 7 ) c /n(U^) = 0. 
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For a F0mer sequence {F n }, the condition \F+\ < C\F n \ is equivalent to saying that {F n } is 
tempered with constant C. 

The following properties will be useful. They state that if {F n } is tempered then the size of U + u, 
Li(uV u x ) and U + (u V v x ) are at most a constant times the size of Uv, assuming some disjointness 
criteria from v or (is, u x ). 



Lemma 4.9. Suppose {F n } is tempered with constant C. If v is an e-disjoint cover then | U + v\ < 
U v\. 

1— £ 1 1 

Proof. We have 

i u + v\ = i u F+f\ < \ p nf\ < \ p nf\ < T^i U V \ 

V V V 

(the last inequality follows from the fact that v is e-disjoint). □ 

Lemma 4.10. // (v, u x ) is e-disjoint then \ U (y V v x )\ < -^ \ [Jv\. If in addition {F n } is tempered 
with constant C then \ U + (v V v x )\ < \ U v\. 

Proof. If (u, v x ) is e-disjoint then it is clear from the definitions that v is e-disjoint. We therefore 
have 

| U (v V v x )\ < Y,i\Fnf\ + \{F n ff\) = 2j2\Fnf\ < j^-J U v\. 



If {F n } is tempered with constant C, 

U+ (v \Jv x )\< 2j2\Fjf\ < 2Cj2\Fnf\ < — £ \ U v\. □ 



A cover v over E is constant if for some no every member of v is of the form F no f. We then say 
that v is an no-cover. For any cover u, we can write v as v = Vi=i u i where each Ui is a constant 
i-cover. We say that v; L is the i-th level of v, and that {^} is the decomposition of v into levels. 

Finally, we say that a cover v over E is simple if for each / <G E there is exactly one n such that 
F n f G v - Any cover v over E has a simple sub cover over E. 

The first step in proving that large covers v exist such that (u, u x ) is e-disjoint is the following 
lemma. 

Lemma 4.11. Let < e < \ and x £ £1. Let u be a cover over a set E. Then there exists a simple 
subcover \i C v over a set E' = {/i, . . . , //} C E such that (u, fi x ) is e-disjoint and | U yu| > 8+ e 2C |-^|- 

Proof. With regards the requirement that /i be simple, we merely note that we can replace v with 
a simple subcover of v and proceed from there. 

We first prove the lemma under the assumption that v is constant and then proceed to the general 
case. 
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Suppose v is a constant ra-cover over B. We will show that there exists E' C B such that for 
/i = v\e> we have (/i, is e-disjoint and | U /i\ > §|B|. Let E' = {/i, . . . , //} C B be a maximal set 
such that for /i = v\e', H x ) is e-disjoint. Set D = U(/i V fi x ). If |D n E\ > 4|B7| then by lemma 
ETTU1 

, 1 — e, , 1, , 1, 1 , 

U/i > — — LD > ~LD > -pnfi > - LB 
2 1 1 ~ 4' 1 ~ 4' 1 8 

so we are done. Otherwise \D n B| < ^|B|. Then by minimality, for each / G E \ D it must hold 
that |B n / H B>| > e|B n |. Since each g & D can lie in at most |B n | right translates of F n , we have 
\D\>-Jh- e\D \ E\ ■ \F n \ > %\E\, so 

U a > LD > ^ — LB > - \E . 

Returning to the case where the v is not constant, let v = Vj=i v % De the levels of v. Since v is 
simple, the set of centers of Vi and Vj are disjoint fro i ^ j. We define inductively for i = I down to 
i = 1 subcovers [ii of v, L over sets E[ = {fn, . . . ,fiM(i)} such that (fii,fif) is e-disjoint, as follows: 
Assume we have defined /ij for j > i and define 

v[ = {Fif G Vi : Fif n (U0j V ^)) = for every j > i] . 

This is equivalent to saying that v[ = Vi\E i , where 

Ei = dom^, \ y (u+(/^ V • 

Now apply the one-level case proved above to v[ to obtain a subcover /Xj which is the restriction of 
Vi to B| = {fn, fiM(i)} C Bj. 

Finally, set B' = UB^, and order E' as in 

E> = {fi}ill = {fn, ■ • • ! fiM(i),- ■ ■ ! /n> • • • > /lM(l)} 

(note that the B^ are pairwise disjoint), and define fi = U//j = v\e>- 

It is easy to verify from the construction that (//,// x ) is e-disjoint. We must verify that U/i is 
large enough. Write D = U + (fi V fi x ) and note that B, D domfj \ D so 

UiBi D (Ui dorn^,) \D = E\D. 

Hence 

i u = y~j u //ii 
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which by the definition of [ii and the one-level case proved at the beginning implies 



l 



umI > oEl^ 



i=l 
1 



> fiy* 



i=l 




the last inequality since by lemma 14.101 we have \D\ < U After rearranging this is 




□ 



Next, we want to get rid of the e in the constant on the right hand side of the last inequality. For 
this, the idea is to apply the last lemma repeatedly to several "layers" of covers. If we arrange that 
the covers are invariant enough with respect to each other, we can at each stage obtain a subcover 
with size almost a fixed fraction of the size of what remains after the last stage, and repeating this 
enough times we will get a large fraction of E. 

Definition 4.12. For fixed {F n }, e > 0, if m < n are integers we write m -< e n if for all n' > n 

|(Uj< m i^~ )F n i\ < (1 + e)|F n /|. For covers v\,V2 we write v\ -< £ vi if max 1/1 -< e min^2. 

It is easy to verify that the relation -< e is transitive. As an example of the relation ~< e , note that 
if {F n } is (1 + e)-tempered, then i -< e (i + 1) for all i. Also note that for any F0lner sequence {F n } 
and for any m and e, m -< e n is true for large enough n. 

Definition 4.13. For n £ N and e > 0, let 



For a cover v let U £ u = U{F^ e) f : F n f £ u}. 

By definition of the -< £ relation, \Fn \ < (1 + e)|-F n |. The following lemmas are proved like 14.91 



Lemma 4.14. If v is an e-disjoint cover, then \ U e v\ < jzr\ U u\, and if (v,v x ) is e-disjoint then 




( U F ^) F n- 



i:i^i E n 



and Eni 



\{U £ u) U {U £ u x )\ < 



2(l+e) 



Ui/L 



l-e 
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Proposition 4.15. Let < e < \ and x £ Vt. Then there exists an integer N such that for any 
sequence of N covers v\ -< E V2 ~<e ■ ■ ■ V N over E there a exists simple subcover u C \/ k=1 Vk such 
that (/j,, /j, x ) is e-disjoint and | U > g|-B|- 



Proof. We will describe a subcover fi C VfcLi v ki an d show that if N was large enough to begin 
with, the desired properties hold. 

The proof is similar to lemma 14.111 Define inductively, from i = N down to i = 1, subcovers jii 
of Vi defined over sets E[ = {fa, . . . , fiM(i)} ^ B as follows: Assuming we have defined /%-,B'- for 
j > i, set 

Ei = E\{J (dom/ij U (U e (^- V uj)) 

and v'i = Vi\Ei- Hi and E[ are obtained by and applying lemma ll.lll to v[. Finally, define E' = Uj<jB^ 
and order E' as in lemma ll.lll 

E' = {/i}j=l = {fill ■ ■ ■ , flM(I), i /ll; > /lM(l)} 

(as before the E[ are pairwise disjoint). Set u = Ufij It is easy to verify from the construction that 
\i is simple and {h,h x ) is e-disjoint. We now show that | U yu| > \\E\. 

Set D = U e (^V fj, x ). Then \D\ < \ U/i|. Also, clearly |dom^| < |U/z| because \i is e-disjoint. 

Thus for each % = 1, . . . , N, 

\Ei\ > \E\-\D\-\domfJk\ 

> W -(»fii4 + 1 )|u„ 

> LB Uu 

1 — e 



so by lemma 14.111 



N 

Ufi\ = 5^|UA*i|> 



i=l 
iV 

> y^— ib, 

- ^8 + 20' 

iV 

- ^8 + 2C u 1 l-e 1 M7 

eA r ,. _,. 3 — e. 

> -(LB Uu) 

~ 8 + 2C vl 1 l-e 1 Pl7 



3-e, 
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and after rearranging this is 

for N large enough this is greater than h\E\ □ 



We will also need the following lemma, which can be proved by the same techniques as above. It 
was first proved by E. Lindenstrauss in pj, or see jllj for a different approach more similar to ours. 

Lemma 4.16. Let < e < \. Then there exists an integer N such that for any sequence of N 
covers v\ < e v 2 -< £ . ■ ■ -< £ vn over E, there exists a simple subcover v C Vi=i v i such that v is 
e-disjoint and | U v\ > (1 — 2e)|2£|. 

We end this section with a combinatorial result akin to lemma W7H 

Lemma 4.17. (Number of e-disjoint covers) Let L < N and < e < \. Then the number of 
simple e-disjoint covers v with centers in Fn, minzv > L and Uz/ C Fn is at most 2 p ^- £,l ^ Fn \ where 
lim^^oo p(e, L) = 0, uniformly in e. 

Proof. Each cover v = {F n ^fiY- of the type we are interested in is determined by its centers 
{/lj ■ • • ; fi} and the values of n(i) for i = 1, . . . , I. If v is e-disjoint and \Jv C F/v, we have 

I 

\F N \ > | U u\ > (1 - e) \Fn{i)fi\ >(l-e)-I-L 

i=l 

since \F n f\ > n and minz/ > L. Thus I < rxz^YE l-^vl- Since we only consider simple covers, the 
number of possible sets of centers for such e-disjoint covers v with centers in Fn is bounded by the 
number of subsets of size at most ^ 1 _ 1 £ ^ aL l-F/vl of \F^\, which by lemma 11^1 is bounded above by 

2 l5 ( (l-'e)z, )l F ivl i 

We now calculate how many e-disjoint covers v exist which satisfy our assumptions and have 
domi/ = E for some fixed set E. We use a coding argument: Given v = {F n (^fi}. as above, we 
write down the values n{i) in binary notation in some pre-determined order (eg fix an order on G 
and write n(i) down according to the order of the fi). In order to decode the resulting concatenated 
sequence of Os and Is we must insert "punctuation" into the resulting concatenation of numbers. 
We therefore code each binary digit as 00 and each 1 as 11, and terminate each number with the 
pair 01. Using this scheme, we can clearly recover v from E and the codeword. In order to code 
n{i) in this manner we use 21ogn(z) + 2 bits. Recalling that we always assume \F n \ > n, the length 
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of the resulting codeword is 

/ I 
^(21ogn(i) + 2) < ^(2 + 21og|if n(<) / i 



i=l i=l 



< 2-/ + 2^|F, 



n(i)\ 



.loe|-F„(i)| 



F„ 



i = l I 

<- 7T^T7™+ 2 '^X>»<.>/. 



2 


(1 


-e)L 




2 


(1 


-e)L 




2 


(1 


-e)L 



i=l 

21ogL |UF n(i) /i| 



L 1-e 



21ogL , , 

/ 2 i log L \\p 

Thus there are at most 2 ( 1 - £ ) L ) (i-e)^' 1 JVI such covers over a fixed set E. 

Putting this all together, we see that the number of e-disjoint covers v with mmv > L and 
Uv C Fjy is at most 

-( ln2 '(ra +i( (r4i )+ (r^)^ 

which proves the lemma. □ 



4.4. Proof of the lower bound for T* (Part II). 

Proof, (of theoremOl) Fix {F„},{W„} and the (X g ) G , defined on (O, T, P). Suppose P(rj: F ' W) < 
h — e) > p > 0. We are going to use lemma 1%. 151 to construct a sequence {c„} of F n -codes such that 
eventually almost surely x{F n ) can be reconstructed from c n (x), and 

^r-Ac n {x))<h- P -e. 
\F n \ 5 

This would contradict the fact that entropy is a lower bound for the achievable coding rates (section 
14. ip . since we can proceed as in lemma UTTl and turn {c n } into a faithful code with the same rate. 

Fix a r] > and a series {3>n} of sets of entropy-typical Fn-patterns, <I> n C Y, Fn , such that for 
every ^6$, 

2 -(h+ri)\F n \ < pfl^]) < 2 -( /w ?)l F "l 

and x € [3> n ] eventually almost surely. For x € f2 and iV fixed, consider the pattern x(Fn); the 
codeword ctv(x) depends only on this pattern, and is constructed as follows: 



Return times, recurrence densities and entropy 37 

Step 1: Consider the sets F^f C Fn such that T^{fx) < 2^ h ~ e / 2 ^ Fk \ and from this collection 
try to extract a cover fi such that (/J,,fj, x ) is //-disjoint and very large. To be precise, we 
require that | U fj,\ > fl-F/vl- However, we try to choose \i so that the next step is possible: 

Step 2: Consider the sets F^f C Fn for which fx £ [$k]i an d from this collection try to 
extract an 77-disjoint cover v which is also disjoint from U/i. We choose v as large as 
possible; we require that Uv and U/i together cover all but a 3rj- fraction of Fjv. 

Step 3: If we cannot find /i, u as in steps 1,2 with the required sizes, we define cn(x) to be 
the empty word. 

Step 4: Otherwise, let A\ be the set guarenteed by lemma ET71 when applied to /i, so A\ C U/i 
and I Ai| < 2rj\ U /i| < 27/l-F/vl, and has the property that if pattern x\{A U (U/i x )) is 
known and if the elements f x,k are known for every F^f £ /z then x(U/u) can be deduced. 
Also, let A 2 = F N \ ((U/i) U (Ui/)). 

Step 5: Now define (assuming steps 1 and 2 were successful) c n (x) = 717273, where 

• 71 encodes the pattern x{A\ U A2). 

• 72 encodes the pattern x(U^). 

• 73 encodes the cover /i along with the elements f x,k for each Fj~f £ /i. 

It is clear from the definition of A\ that if steps 1,2 succeeded then we can reconstruct x(F^) from 
cn(x). It remains to show that for almost every x, steps 1 and 2 will succeed for large enough N, 
and that in this case the encoding, as described in step 5, can be carried out in such a way that 
£(c N (x)) < (h-p/10)\F N \. 

We begin by showing that for large enough N, steps 1 and 2 will succeed. Fix integers M, L to 
be determined later, and select a sequence of intervals {[aj;&i]}f=i such that a\ = M, hi -< v a-i + i 
and 

(1) For a set of x £ £1 with probability greater than p there exist indices rrii(x) G [a^ ; 6^] such 
that T mi ( x }(x) < h — e for each i. 

(2) For a set of x G ft with probability greater than 1 — 7/ there exist indices rii(x) £ [a^; bj\ such 
that x £ $ ni (a;) f° r each i. 

Such a sequence is easy to construct using the ergodic theorem, the fact that P(T* < h — e) > p, 
and our choice of {•I'n}; we omit the details. 

From the pointwise ergodic theorem and the asymptotic invariance of {F n }, for almost every 
x £ f2 it holds for large enough N that 

for L + 1 < i < 2L there exist indices 
/ £ F N mi(f) £ [a;; &»] with T mi{f) (fx) <h-e ) > p 
F mi{f) f C ^ and (F m , l{f) fY C F w 
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for 1 <i < L there exist indices 
( 1.2) ^# I f G F N ni (f) G [a % - h) with } > 1 - ;/. 

fx G $ ni(/) and F ni (/)/ C ify 

For x, A" such that the above holds, let rrii(f) and rii(f) be as in (|4,1[) and 1)4.2(1 . 

Let i?i C Fn be the set described in 14.11 and Hi = \Fmi{f)f} / e Bi' so we nave covers A*i over ^l- 
Note that li% H2 -<<q . . . -<n PL by our choice of [a^; so assuming L was chosen large enough 
we can apply lemma 14.161 to . . . , /x^ and obtain a simple subcover fi C \/ i=1 /ij over some set 
£^ = {/i, . . . , //} C i^x such that (/j, [i x ) is ^-disjoint and | U p\ > ||-Ei|, which means that 

Let £^2 ^ -Pjv be the set described in 14.21 and t'j = \Fm(f)ff t eE2 i so we have covers z/j over 
As with the m, we have v\ -< v v 2 -K v . . . ul, and furthermore vl -<n [i- Let v[ be the restriction 
of Vi to E2\U' n n, so each member set of i/ is disjoint from U/i. Assuming L is large enough we apply 
lemma 14.161 and obtain an 7?-disjoint simple subcover u C \/i=i v 'i over some set E' 2 C £J 2 \ (U^/u) 
such that | U i/| > (1 - 2t?)|# 2 \ (U»|. We then have 

| U i/| > (1 - 2t?) (\E 2 \ - | U" /x|) > (1 - 277) f (1 - r?)|F w | - i±^| U M | 



1—7? 

By construction v are disjoint, and putting the last two paragraphs together, we have 

| u (mVz/)| = | U(J,\ + | Ui/| > 
> (1 - (1 - 2r/)i±^)| U ^1 + (1 - 2r/)(l - r?)^ > (1 - 3»7)|*V|. 

1—7/ 

Thus all but a 37?- fraction of Fn is covered by /lUy, and we have shown that for almost every x, 
for large enough N there exist fj,, v as required by steps 1 and 2 of the coding construction. 

We complete the proof by giving a more detailed description of the encoding of cn(x) (in the 
case where steps 1,2 succeeded) by specifying exactly how to construct 71,72,73, and show that the 
codeword length is indeed bounded by (h — p/5)\Fn\- 

Recall the functions 5(-) and />(•, •) defined in lemmas 14.21 and 14. 171 respectively. In the estimates 
below we ignore rounding errors; the reader may verify that these may be taken into account without 
disrupting the proof. 

Fix x, N so that steps 1,2 of the construction succeeded. Let /i, u, A\, A 2 be as in the construction. 

Write H = A\ U A 2 . The word 71 will first encode the set H and then the values x(f) for / G H. 
The former takes 5(57?)^^! bits, because by lemma l4~2*l we can costruct a list of all 2 s ( 5r, )\ F n\ subsets 
of F n with size at most 57?|.F n | and then describe H by giving its index in the list. Next, using some 
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fixed ordering of G (which induces an order on H), we write down the symbols x(f) for / G H. 
For this we need an additional |if|logE < (5?? log S) |i*jv| bits. So all together x(H) can be coded 
in (5(577) +5rjlogE)\F N \ bits. 

The word 72 will be coded by specifying first the cover v and then the values x(Fkf) for F^f G v. 
The first task is accomplished using p(r], M)|i*V| bits by lemma [4.171 Now for each F^f G v, 
according to some fixed order, we record the pattern x(Fkf) by giving the index of the pattern 
x(Fkf) in This requires (h + r])\Fk\ bits by our choice of {<3? n }, because |$fc| < 2( ft+?? )l Ffe l, and 
because by the definition of v, fx G whenever F^f G v. Thus to describe all the patterns 
x{Fkf) for F^f G v, we need 



£> + r/)|F fc /| 



l**/l<£^W| 

1 — 7] 



bits. All together, we see that we can code 72 with at most 79(77, M)\Fn\ + \ U i/| bits. 

The word 73 is coded by first describing p, and then, for each F^f G /j, we describe f x > k . The 
former is again achieved with 79(77, M)\Fpf \ bits (the same calculation as above applies), while the 
latter requires at most (h — e^F^r*-)] bits for each F^f G p, by definition of f x ' k . Note that this is 
where we use the assumption that {W n } is increasing: for if it were not, then in order to record 
jx,k we -would need to specify also which set W n it is from. In other words, we would need to record 
Rk(fx), and this could be expensive (there is no upper bound on Rk(fx)). However, since {W n } 
is increasing, we can assume that f x,k comes from the largest set W n satisfying \W n \ < 2^ h ~ e " Fk ^). 
Thus estimating as we did for 72 we find that 73 can be coded in 

p(ri,M)\F N \ + {h- e)Y,\Fkf\ < p(v,M)\F N \ + ^| U M | 

bits. 

Putting all this together, we find 

£(c N (x)) = £(71) +^(72) +^(73) 

' (5(777) + 5?7 log S) \F N \+ 



1 



V 



< 



< 



+ l p (r,,M)\F N \ + !£ 



h+r)\ 



V 1 
h—e I 



U v\ 



+ lp( v ,M)\F N \ + ^\Up 



(5(7rj) + 5r? log S + 2p(r], M)) \F N \ + 



+^|(Ui/)U(U M )| 



< (h + T(rj,M))\F N \ 



77 p 



\Fn\ 
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where t(t], M) -t0asM->oo uniformly in r\. So for M large enough and w small enough we have 

£(c N (x))<(h-^e)\F N \. 
This completes the proof. □ 



4.5. The lower bound for [/*. In this section we prove that 

ui F ' W \x) = hminf M--L\og(u( F n W \x) - -!-) > h 

k^oo n \F k \ K ' n \W n \ 

almost surely if {Fn},}^} satisfy the conditions of theorem ll.lll 

As was already mentioned in the introduction, the correction term — Tyy-t in the definition of U* 
is necessary because Uk, n counts the central pattern x(F^); without it, if for instance Iq € Wi, we 
would have for every k that U^i > m^n, and this implies £/* = for any process. 

In the previous section, we defined the quantities Vk <n and V* , and showed that U* = V* . We 
may analogously define 



Vif = lim inf inf — - — - log( Vj s 



"* « |^|" ov ' fe,n \W n \' 

as a version of f7* which counts all repetitions of x{Fk) in x(W n ), but we cannot, as was the case 
with V* , show that U* = V*, because when the correction factor is taken into account, if we compare 
n — y^y with Vk in — j^q as in equation |3JJ we find that all we can say is 

Uk,n ~ i,„ i < Vhn — . ; < |-Ffc| 2 (^fe.n ~ i TT7 i ) + L T ^ i 

\w n \ \w n \ \w n y \w n \ 

and this gives us {/* > K but not the reverse inequality. 

Furthermore, the fact that Uf. n does not count repetitions of x(-Ffc) which intersect F^ is necessary 
for our proof that f7* > /i; the proof cannot be adapted to the case of V*. The importance of the 
requirement that Uk n not count the repetition which intersect the original pattern stems from the 
following observation, which is used in an essential way in the proof: For almost every x, for large 
enough k, U)~ n (x) = unless \W n \ > 2^ h ~ £ " Fk K This follows from the fact that if U^ n {x) > 1 then 
there is within W n a repetition of x{F^) disjoint from F^, and so Rk{x) < n, which in turn implies 
Tk(x) < \W n \ (we will assume that {Wi} is increasing); now use the fact that for large enough k, by 
theorem 14.31 T)~(x) > 2 ( - h ~ £ ^ Fk K In the case of Vk >n , if Vfc jTl > 1 it may be because of a repetition of 
x(-Ffc) which intersects F^, and this argument fails. Thus we can't allow repetitions which intersect 
Ffc to be counted in Uk t n because Rk was defined as the first repetition of x(Fk) disjoint from F^. If, 
however, we were able to prove an analogue of T* > h (theorem l4.3|) for the case where Rk counts all 
repetitions in W n of x(F^), including those intersecting Fk, then the proof below would work just as 
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well for V*. This version of theorem 14.31 is true, for instance, in the case of W n = [—k n ;k n ] d C 
This was proved in |Sj. 



One can, however, define 



V k , n (x) = tttt 7 max I \E\ 



\W n \ 



1 G G E C W n and if l G ^ f e E then 
F k nF k f = (b, and fxe[x(F k )} 



Then we have that 



1 



~~ iw i — ~~ it?7 i — \E k \ 2 {U k , n 



\W n \ ~ \W n \ ~ ' ' ' * W n 



and so = liminffc^oo inf n -tJh log(t4 in (x) - = £/*. 

Our proof that U* > h is based on the bound T* > h. It is not hard to see that ui F ' W) > h 
implies T# W ' > h. We would like to show the reverse, namely, that U* F ' W ' < h implies T* < h. 
The idea behind the proof is that from the relation U k ^ n (x) — r^i > 2~^ h ~ e ^ Fk \ which means that 
there are "too many" repetitions of x{F k ) in W n , we can sometimes deduce that many of the points 
g € W n at which x(F k ) repeats are such that another repetition of the same pattern appears "too 
close" to g. If we could show this, we can appeal to the fact that > h to obtain a contradiction. 
One problem here is that the window sets W n may be unsuitable for capturing the idea of "too 
close". This will happen if the sequence {Wn} grows too quickly, and in this case we will not be 
able to obtain a contradiction through t[ F ' . We therefore introduce a second window sequence 
{Yn,}, which will grow slowly enough that with respect to {5^}, a drop of T^ F ' Y ^ below entropy 
can be observed. We will show that for a nice enough sequence {^j, if with positive probability 
jj(f,w) ^ ^ _ £ - g ^ rue ^ t nen [i j s no t a l m ost surely true that T* F ' Y ^ > h, which is impossible. 

We recall the following definitions, which were given in the introduction: 

Definition. Let {F n },{W n } be sequences of finite subsets of G. An increasing sequence {Y n } of 
finite subsets of G is called an interpolation sequence for {F n }, {W n } if 

(1) {Y n } is filling and increasing . 

(2) If \W n \ > \Y m \ then \Y m W n \ < C\W n \ for some constant C. 

(3) For every pair of real numbers < a < (3, for every large enough n there is an index k such 
that 2 a \ F "\ < \Y k \ < 2^1. 



The main lemma we will need about interpolation sequences is that too frequent recurrence of 
x(F k ) in x{W n ) implies that Tff (fx) is too small for many / 6 W n . 

Lemma 4.18. Let Y be 1/C -filling and E C W C G finite sets. Suppose that \YW\ < C\W\. If 
> a then 

1 ,r,_„ ^2/3 



{/ e E : \YfDE\>(3}\>a-C 2 



\W\" J 1 J Y 
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Proof. Let EL = {f E E : \Yf Pi E\ < /?}, and let {Y /i}f =1 be an incremental cover of EL with 
fi G E_. Then by the basic lemma 1X71 we have that 



\E-\ P 



since UiYfi C YVK we have 



so 



\UiYfi\- \Y\ 
|EL| < C-^\YW\ < C 2 -^\W\ 
f eE:mB> l3 ]: = ± ;m - ] E4)>a-0'±. □ 



\wy { \Y\ ' J 1 \W\ y ' 11 " - \Y : 

Corollary 4.19. Let < 5 < e/2, and let {Y n } be an interpolation sequence for {F n } ,{W n }. If 
for some k,n and x G Q, we have U^J^ {x) > 2^ h ~ e ^ Fk ^ , and if \W n \ > 2^~ <5 )I- Ffc , then if k is large 
enough, 



W n 



f£W n 



fx € [x(F k )} and 1 K-(h-e)\F k \ 
T^ Y \fx)<2^/ 2 ^ J 1 2 Z 



Proof. Set E = {f £ W n : fx € [x(Ffc)]} and (3 = \F k \ 2 . For k large enough there exists an m such 
that 

2 (h-2e/3)|JH < ly I < 2 ( h - £ / 2 )\ F k\ 

Since \W n \ > 2^)1^1, we have \Y m W n \ < C\W n \, so by the lemma (with a = 2~^ h ~ e )\ F k\) 

' '{f€E: \Y m fPE\ > \F£\} | > 



\W n 



> 2 -(h-e)\F k \ _ q1\ F \2 2 -{h-2e/-Z)\F k \ > ^ 2 -(h-e)\F k \ 

~ 2 

for k large enough. Now note that if for some / £ E we have \Y m f P\E\ > \F k \ 2 then at least one of 
the repetitions of fx(F k ) within fx(Y m ) is disjoint from fx(F k ), so T fc (F,y) (/x) < |y m | < 2( /l ~ <5 )l i; *l. 
This completes the proof. □ 

The second requirement we make of {W„} is that it be incompressible. We recall the definition: 

Definition. An increasing sequence {Vt 7 ™} of finite subsets of G with Iq € W n is said to be 
incompressible with constant C (or C-incompressible) if for any incremental sequence {W n [i)fi] : 
the number of the sets W n n\fi containing 1^ is at most C. 

An incompressible sequence {lf n } is filling; if {Wn} is incompressible with constant C then it 
will be filling with constant ^, since for a collection {W n ^fi}i & j as above, each g € Ui e jF n ^fi 
belongs to at most C of the sets W n n\fi. Incompressible sequences are superior to filling sequences 
because of the following observation: If {W^} is incompressible and {W n ^fi}{ =l is incremental, 
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then for any finite A C G, we have 

\An\Jw n{i) fi\>± \ Anw nwfi\- 

iei i<i<i 
This is because every a € A is counted at most C times in the sum on the right. 

The property of incompressibility is equivalent to the following property, which may be described 
as being filling relative to arbitrary subsets of G: 

Lemma 4.20. Let {W n } be incompressible, H\,H% C G and suppose { W^fe} 1 is an incremental 
sequence such that for each i = 1, . . . , I we have that 

\Hinw n(i) fi\ 
- \H 2 nw n{i) M ~ p 

Then 

a |gin(U^ w(i) /,)| 

c ~ |ff 2 n (UiW^/OI p 

( this remains true for a = or (3 = oo) . 

It is not difficult to see that if {W n } satisfies the conclusion of the lemma then it is incompressible. 
Proof. We show for instance the lower bound. 



|ff 2 n(iw nW /;)| " El^2nw n(i) /, 

(we use here the fact that {W n } is ^-filling) 

> i x ^ \H 2 nw n{i) fi\ |ffi n w ra(0 /j| > q 
"^YE, 1^2 n w nW /,| ' |ff 2 n w^/il " C 

Theorem 4.21. Zei F ra &e a tempered F0lner sequence, {X g } g£G an ergodic process with entropy h. 
Let {W n } be an incompressible sequence. If there exists an interpolation sequence for {F n } , {W„}. 
Then 

liminf inf- — — logUj: F ' W \x) > h. 

fe^oo n \F k \ lt ' n 

Proof. Suppose the theorem is false. Then there is a set B C CI with P(B) > p > such that for 
every x G B, 



limsup sup -7777 l°g ^/'^(x) <h — e 



1 

k— >oc n \Fk\ 

for some e > 0. Let {<!?«} be a sequence of sets of words, <3? ra C S^™, such that 

1 



log|$„| -► /i 
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and x G [3> n ] eventually almost surely. Fix L very large; we may assume that x G [3> n ] for every for 
n > L and x £ B. We may also assume that L was chosen large enough that |3>fcj < 2^ h+ ^ Fk ^ for 
fc > L. 

For every x £ B there exists an index k(x) > L and an index n(x) such that 

U k{x)Mx) {x) > 2-^-)l^)l 

and without loss of generality we may assume that for every x G B we have n(x) < M for some inte- 
ger M. We further assume, increasing L if necessary, that for every x £ B, |W n ( x )| > 2^ h ~ £ ^^ Fk< - x ^ , 
because T* F ' W ^ > h almost surely. 

Let x € fi be typical in the sense that the ergodic theorem holds for B and every L < k < M. 
By the ergodic theorem, we can choose iV large enough so that 

j^-r\{f€F N : fxeB}\>p. 

For / € F/v such that fx G £>, we will write for brevity &(/) = k(fx) and n(/) = n(fx). 
Step 1: Fix L < k < M and G and let 

H = {f € F N : fx €[<p]} , E = {f € H : fx € B , k{f) = k} . 

If / G E then U^^Q{fx) > 2~^ h ~ £ ^ Fk ^. Some of the points g G W n ^f at which 99 repeats will 
belong to E as well, and some won't. Write E + for the set of / G E for which "many" of the 
repetitions of <p in W n ^f are in E: 

E+ = { f£E '■ ^ W n(f)f^ E \> 2 ~ {h ~ £ ' 2)m } 

and 

E ~ = E \E+ = {f eE : ^|^ n(/) / n ^| <2-( /l - £ / 2 )l^l|. 

We are interested in the elements of Fn for which Tj^ ,Y ^ < 2~( h+£ / 4 ^ Fk \ . Let 

D = [f€H: if ' y) (/x) < 2 -(' 1 + £ / 2 )^l} . 
Now if / G E + , then by lemma 14.191 we have that 

\(Enw n(f) f)nD\ > ±\Enw n{f) f\. 
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Select an incremental cover {W^/j}!^ of E + with f\ € E + . For each i we have that the relative 
density of D n in Wn(/i)/j is at least half the density of E, so from lemma H.2UI we see that 

l,j |^n(uw n(/i) /OI -2C 

and therefore, since i? + C UW n m/j, we have 

(4.4) \EnD\>±\E+\. 

On the other hand, if / G E~ , we have that the number of elements of H in W n rf\f is at least 
2( £ / 2 )\ F k\ times the number of elements of E in W n tf)f, so 

\w n(f) f nD\> \\w nU) f nH\> I 2 ( £ / 2 )i^i • \e n w n(f) f\. 

Now select an incremental cover °f From lemma we have 

f45 ) \Dn(uw n{ft)ft )\ i (£/2)|Ffc| 

1 ■ ; \En(UW nm fi)\ ~ 2C 

and therefore, since E~ C UH^m/i, we have 

(4.6) |D|>2_ 2 ( £ / 2 )I^I|S-|. 

We now face two alternatives: 

(1) If IE 1 " 1 "! > \\E\ then equation POl gives us 

(2) Otherwise, if |-E^'~| > \ equation 14.61 gives 

\ D \ > ± 2 ( £ / 2 )l^l|^-| > J-2( £ / 2 )l F *l|£|. 
\ \ - 2C 1 - AC 11 

Step 2: For fixed k and </j € write D v , E^ for the sets D, E of step 1, thus making explicit the 
dependence on (p which was previously suppressed. Note that the collections {D^} , {E^} 
are pairwise disjoint. For each cp € &k we have by the above that either (a) lE^ n D {p \ > -^lE^] 
or (b) \D^\ > 3 ^2( £ / 2 )l Ffc l|^|. Let C be the set of 99's for which the first alternative holds, 
and its complement in There are again two alternatives: 

(1) I U^ £$ + EP\>\\ U^ k EV\. In this case 



I \J (EVnDV)\ > I J (£^nzF)| 
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= i^i U U 

(2) | U^ e$ - > i| U ve$fc EP\. In this case 

| (J £^| > | (J = 

= E izm^/^i 1^1 = 

> J_ 2 (e/2)|F fc |i I I E V\ y J_ 2 (em\F k \ l I I ™, 

- AC 1 U 1 " 8C 1 U 

Step 3: We now let k vary between L and M. Write = U </ , e $ fc £" / ' and D k = U !pe ^ k D ip . Note 
that 

E k = {f £ F N : fx e B and = k} 

D k = {f£F N : fx€ [* fc ] and rf > y) (/x) < 2^-^)1^!} 

so {E k } L<k<M are pahwise disjoint, but {D k } L<k<M need not be. We saw in step (2) that for each 
L < k < M, either (a) \E k DD k \ > ^\E k \ or (b) \D k \ > ^^^^ \E k \. Write 

J = jjfe : L < k < M and |D fc | > ^2 (£/2)|Ffcl j . 
Let fc € J. If 1^1 > 16C • 2-( £ / 2 )I f *=I|Fjv| we would have 

|A.|>^2( £ / 2 )i F '=i|ii; fc |>2|F w |. 

However, C (Ui,<fc<Af Wfc) \Fn\ and the latter set, if N is large enough, has size strictly less than 
2|Fjv|, so the inequality \D k \ > 2\Fjy\ is impossible. Thus for N large enough, if k G J it must be 
that \E k \ < 16C ■ 2-( £ / 2 )l F fcl|Fiv|, so 

J_i i i Ek \ = y \E k \ < ^ y 2-w\ F *\ < p 



for L large enough. Therefore, 



i^fi U °*i>j^iU^*i = i^|£i^*i2 

1 iv 1 L<k<M 1 JV 1 fc^J 1 JV 1 h(£J 



Return times, recurrence densities and entropy 



47 



1 1 k<£J 1 1 k£J 

= 1 I II E k \ rlll^fcl 

8CF N ] ^ fc| 8CFiv IU 

1 1 L<k<M 1 1 fee,/ 



> 



by assumption, ^ Uz,<fc<A/ s * 



> p, so 

p p p 



> 



8C 16C 16C 

But this is impossible, since it implies that at least a j^-fraction of / € -Fat are such that 

'^L<k<M T^' Y \fx) < 2( /l ~ £ / 2 )l- Ffc l Since L was arbitrarily large, this contradicts the fact that 

T* ^ > h a.s.. This completes the proof. □ 

In order to prove the theorem for {Wn} a quasi-incompressible sequence ( definition II. llty we need 
an analogue of lemma I4.2U1 This lemma is used in the proof in step (1) to justify equations 14.31 
and 14.51 Recalling what occurred there, we had fixed k and 92 E T> Fk and considered a set E C F^ 
of points / such that uj^Q(fx) < 2~( ft ~ e )l Ffc l Note that the right-hand side of this inequality 
depends only on k. Furthermore, one of our assumptions was that |W n r/)| > 2^ h ^ £ ^ 2 ^ Fk ^ for / € E. 
It is not hard to see that under these conditions a version of lemma 14.201 is valid, assuming that the 
W^-i-boundary of W n is small, in a manner depending on k, for all n such that |W n | > 2^ h ~ £ / 2 ^ Fk \ 
and this is just what quasi-incompressibility means. We omit the details. 

5. Open questions 

In conclusion we would like mention several problems which remain unresolved. 

The method of proof of the fact that T* > h made it necessary that in the definition of Rk we 
require that a repetition of the pattern x{F^) be "counted" only if it is disjoint from the original 
pattern. This definition affects also the definition of Uk,n and the results about £/*. In the case of 
G = Z d studied in jHj no such restriction was necessary. 

Question. If we define 

Rff (x) = inf{n : there exists 1q 7^ / G W n s.t fx S x(Fj t )} 
and Tk = | Wn k \ , when is it true that t^t log T& > h ? 

For the upper bounds T* < h and U* < h to hold, we saw that it is necessary to require some 
special properties of the window sets, eg that they be (quasi) filling. We have seen that such 
sequences exist in many cases, but we have no information about the case of non-locally- finite 
torsion groups: 
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Question. Do all groups possess (quasi) filling/incompressible sequences? What classes of groups 
have (quasi) filling/incompressible F0lner sequences? 

Another issue is, under what conditions, milder than those given, does the bound U* > h hold? 
For one thing, the dependence on an interpolation sequence would seem to be an artifact of the 
proof, and perhaps can be removed. It is also not impossible that the requirement that {W n } be 
incompressible might be weakened to being filling, or perhaps even completely done away with; the 
fact that > h for very general window sequences, and that we have not yet found a counterexample 
of the type in section I27T1 for the lower bound £/* > h, gives some hope that this might be true. 

Question. Can weaker conditions on the window sequence be found which ensure the bound U*>h? 
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